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PREFACE 


This is a text for the basic graduate sequence in abstract algebra, offered by most 
universities. We study fundamental algebraic structures, namely groups, rings, fields and 
modules, and maps between these structures. The techniques are used in many areas of 
mathematics, and there are applications to physics, engineering and computer science as 
well. In addition, I have attempted to communicate the intrinsic beauty of the subject. 
Ideally, the reasoning underlying each step of a proof should be completely clear, but the 
overall argument should be as brief as possible, allowing a sharp overview of the result. 
These two requirements are in opposition, and it is my job as expositor to try to resolve 
the conflict. 

My primary goal is to help the reader learn the subject, and there are times when 
informal or intuitive reasoning leads to greater understanding than a formal proof. In the 
text, there are three types of informal arguments: 


1. The concrete or numerical example with all features of the general case. Here, the 
example indicates how the proof should go, and the formalization amounts to substi- 
tuting Greek letters for numbers. There is no essential loss of rigor in the informal 
version. 


2. Brief informal surveys of large areas. There are two of these, p-adic numbers and 
group representation theory. References are given to books accessible to the beginning 
graduate student. 


3. Intuitive arguments that replace lengthy formal proofs which do not reveal why a 
result is true. In this case, explicit references to a precise formalization are given. I am 
not saying that the formal proof should be avoided, just that the basic graduate year, 
where there are many pressing matters to cope with, may not be the appropriate place, 
especially when the result rather than the proof technique is used in applications. 


I would estimate that about 90 percent of the text is written in conventional style, 
and I hope that the book will be used as a classroom text as well as a supplementary 
reference. 

Solutions to all problems are included in the text; in my experience, most students 
find this to be a valuable feature. The writing style for the solutions is similar to that 
of the main text, and this allows for wider coverage as well as reinforcement of the basic 
ideas. 

Chapters 1-4 cover basic properties of groups, rings, fields and modules. The typi- 
cal student will have seen some but not all of this material in an undergraduate algebra 
course. [It should be possible to base an undergraduate course on Chapters 1-4, traversed 
at a suitable pace with detailed coverage of the exercises.] In Chapter 4, the fundamental 
structure theorems for finitely generated modules over a principal ideal domain are de- 
veloped concretely with the aid of the Smith normal form. Students will undoubtedly be 


comfortable with elementary row and column operations, and this will significantly aid 
the learning process. 

In Chapter 5, the theme of groups acting on sets leads to a nice application to com- 
binatorics as well as the fundamental Sylow theorems and some results on simple groups. 
Analysis of normal and subnormal series leads to the Jordan-Hélder theorem and to solv- 
able and nilpotent groups. The final section, on defining a group by generators and 
relations, concentrates on practical cases where the structure of a group can be deduced 
from its presentation. Simplicity of the alternating groups and semidirect products are 
covered in the exercises. 

Chapter 6 goes quickly to the fundamental theorem of Galois theory; this is possible 
because the necessary background has been covered in Chapter 3. After some examples 
of direct calculation of a Galois group, we proceed to finite fields, which are of great 
importance in applications, and cyclotomic fields, which are fundamental in algebraic 
number theory. The Galois group of a cubic is treated in detail, and the quartic is 
covered in an appendix. Sections on cyclic and Kummer extensions are followed by Galois’ 
fundamental theorem on solvability by radicals. The last section of the chapter deals with 
transcendental extensions and transcendence bases. 

In the remaining chapters, we begin to apply the results and methods of abstract 
algebra to related areas. The title of each chapter begins with “Introducing... ”, and the 
areas to be introduced are algebraic number theory, algebraic geometry, noncommutative 
algebra and homological algebra (including categories and functors). 

Algebraic number theory and algebraic geometry are the two major areas that use the 
tools of commutative algebra (the theory of commutative rings). In Chapter 7, after an 
example showing how algebra can be applied in number theory, we assemble some algebraic 
equipment: integral extensions, norms, traces, discriminants, Noetherian and Artinian 
modules and rings. We then prove the fundamental theorem on unique factorization of 
ideals in a Dedekind domain. The chapter concludes with an informal introduction to 
p-adic numbers and some ideas from valuation theory. 

Chapter 8 begins geometrically with varieties in affine space. This provides moti- 
vation for Hilbert’s fundamental theorems, the basis theorem and the Nullstellensatz. 
Several equivalent versions of the Nullstellensatz are given, as well as some corollaries 
with geometric significance. Further geometric considerations lead to the useful algebraic 
techniques of localization and primary decomposition. The remainder of the chapter is 
concerned with the tensor product and its basic properties. 

Chapter 9 begins the study of noncommutative rings and their modules. The basic 
theory of simple and semisimple rings and modules, along with Schur’s lemma and Ja- 
cobson’s theorem, combine to yield Wedderburn’s theorem on the structure of semisimple 
rings. We indicate the precise connection between the two popular definitions of simple 
ring in the literature. After an informal introduction to group representations, Maschke’s 
theorem on semisimplicity of modules over the group algebra is proved. The introduction 
of the Jacobson radical gives more insight into the structure of rings and modules. The 
chapter ends with the Hopkins-Levitzki theorem that an Artinian ring is Noetherian, and 
the useful lemma of Nakayama. 

In Chapter 10, we introduce some of the tools of homological algebra. Waiting until 
the last chapter for this is a deliberate decision. Students need as much exposure as 
possible to specific algebraic systems before they can appreciate the broad viewpoint of 


category theory. Even experienced students may have difficulty absorbing the abstract 
definitions of kernel, cokernel, product, coproduct, direct and inverse limit. To aid the 
reader, functors are introduced via the familiar examples of hom and tensor. No attempt 
is made to work with general abelian categories. Instead, we stay within the category of 
modules and study projective, injective and flat modules. 

In a supplement, we go much farther into homological algebra than is usual in the basic 
algebra sequence. We do this to help students cope with the massive formal machinery 
that makes it so difficult to gain a working knowledge of this area. We concentrate on 
the results that are most useful in applications: the long exact homology sequence and 
the properties of the derived functors Tor and Ext. There is a complete proof of the 
snake lemma, a rarity in the literature. In this case, going through a long formal proof is 
entirely appropriate, because doing so will help improve algebraic skills. The point is not 
to avoid difficulties, but to make most efficient use of the finite amount of time available. 


Robert B. Ash 
October 2000 


Further Remarks 


Many mathematicians believe that formalism aids understanding, but I believe that 
when one is learning a subject, formalism often prevents understanding. The most im- 
portant skill is the ability to think intuitively. This is true even in a highly abstract field 
such as homological algebra. My writing style reflects this view. 

Classroom lectures are inherently inefficient. If the pace is slow enough to allow 
comprehension as the lecture is delivered, then very little can be covered. If the pace 
is fast enough to allow decent coverage, there will unavoidably be large gaps. Thus 
the student must depend on the textbook, and the current trend in algebra is to produce 
massive encyclopedias, which are likely to be quite discouraging to the beginning graduate 
student. Instead, I have attempted to write a text of manageable size, which can be read 
by students, including those working independently. 

Another goal is to help the student reach an advanced level as quickly and efficiently as 
possible. When I omit a lengthy formal argument, it is because I judge that the increase 
in algebraic skills is insufficient to justify the time and effort involved in going through 
the formal proof. In all cases, I give explicit references where the details can be found. 
One can argue that learning to write formal proofs is an essential part of the student’s 
mathematical training. I agree, but the ability to think intuitively is fundamental and 
must come first. I would add that the way things are today, there is absolutely no danger 
that the student will be insufficiently exposed to formalism and abstraction. In fact there 
is quite a bit of it in this book, although not 100 percent. 

I offer this text in the hope that it will make the student’s trip through algebra more 
enjoyable. I have done my best to avoid gaps in the reasoning. I never use the phrase 
“it is easy to see” under any circumstances. I welcome comments and suggestions for 
improvement. 

Copyright © 2000, by Robert B. Ash 

Paper or electronic copies for noncommercial use may be made freely without explicit 
permission of the author. All other rights are reserved. 
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Chapter 0 


Prerequisites 


All topics listed in this chapter are covered in A Primer of Abstract Mathematics by 
Robert B. Ash, MAA 1998. 


0.1 Elementary Number Theory 
The greatest common divisor of two integers can be found by the Euclidean algorithm, 


which is reviewed in the exercises in Section 2.5. Among the important consequences of 
the algorithm are the following three results. 


0.1.1 


If dis the greatest common divisor of a and b, then there are integers s and t such that 
sa+tb=d. In particular, if a and D are relatively prime, there are integers s and t such 
that sa+ tb = 1. 


0.1.2 


If a prime p divides a product a,---a@, of integers, then p divides at least one a; 


0.1.3. Unique Factorization Theorem 


If a is an integer, not 0 or +1, then 


(1) a can be written as a product p1---p, of primes. 


(2) If a = pi---Pn = %1-*- Gm, Where the p; and q; are prime, then n = m and, after 
renumbering, p; = +q; for all 7. 


[We allow negative primes, so that, for example, —17 is prime. This is consistent with the 
general definition of prime element in an integral domain; see Section 2.6.] 
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0.1.4 The Integers Modulo m 


If a and b are integers and m is a positive integer > 2, we write a = b mod m, and say 
that a is congruent to b modulo m, if a — b is divisible by m. Congruence modulo m 
is an equivalence relation, and the resulting equivalence classes are called residue classes 
mod m. Residue classes can be added, subtracted and multiplied consistently by choosing 
a representative from each class, performing the appropriate operation, and calculating 
the residue class of the result. The collection Z,, of residue classes mod m forms a 
commutative ring under addition and multiplication. Z,, is a field if and only if m is 
prime. (The general definitions of ring, integral domain and field are given in Section 2.1.) 


0.1.5 

(1) The integer a is relatively prime to m if and only if a is a unit mod m, that is, a has 
a multiplicative inverse mod m. 

If c divides ab and a and c are relatively prime, then c divides b. 

If a and 0 are relatively prime to m, then ab is relatively prime to m. 

If az = ay mod m and a is relatively prime to m, then x = y mod m. 

If d = gcd(a, b), the greatest common divisor of a and b, then a/d and b/d are relatively 

prime. 


(6) If ax = ay mod m and d= gcd(a,m), then « = y mod m/d. 


7) If a; divides b for7 = 1,...,r, and a; and a, are relatively prime whenever i 4 7, then 
j y Jj 
the product a,---a, divides b. 
(8) The product of two integers is their greatest common divisor times their least common 
multiple. 


0.1.6 Chinese Remainder Theorem 


If m ,,...,m, are relatively prime in pairs, then the system of simultaneous equations 
x = 6b; mod m;,j =1,...,7r, has a solution for arbitrary integers b;. The set of solutions 
forms a single residue class mod m=m, ---m,, so that there is a unique solution mod m. 

This result can be derived from the abstract form of the Chinese remainder theorem; 
see Section 2.3. 


0.1.7 Euler’s Theorem 


The Euler phi function is defined by y(n) = the number of integers in {1,...,n} that 
are relatively prime to n. For an explicit formula for y(n), see Section 1.1, Problem 13. 
Euler’s theorem states that if n > 2 and a is relatively prime to n, then a?(”) = 1 mod n. 


0.1.8 Fermat’s Little Theorem 


If a is any integer and p is a prime not dividing a, then a?~' = 1 mod p. Thus for any 
integer a and prime p, whether or not p divides a, we have a? = a mod p. 
For proofs of (0.1.7) and (0.1.8), see (1.3.4). 
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0.2 Set Theory 


0.2.1 


A partial ordering on a set S is a relation on S' that is reflexive (x < x for all x € S), 
antisymmetric (a < y and y < x implies x = y), and transitive (a < y and y < z implies 
x <z). If for all z,y € S, either x < y or y < a, the ordering is total. 


0.2.2 


A well-ordering on S is a partial ordering such that every nonempty subset A of S has a 
smallest element a. (Thus a < b for every b € A). 


0.2.3 Well-Ordering Principle 


Every set can be well-ordered. 


0.2.4 Maximum Principle 


If T is any chain (totally ordered subset) of a partially ordered set S, then T is contained 
in a maximal chain M. (Maximal means that M is not properly contained in a larger 
chain.) 


0.2.5 Zorn’s Lemma 


If S is a nonempty partially ordered set such that every chain of S has an upper bound 
in S, then S$ has a maximal element. 

(The element x is an upper bound of the set A if a < x for every a € A. Note that 
x need not belong to A, but in the statement of Zorn’s lemma, we require that if A is a 
chain of S$, then A has an upper bound that actually belongs to S.) 


0.2.6 Axiom of Choice 


Given any family of nonempty sets $;, 7 € I, we can choose an element of each Sj. 
Formally, there is a function f whose domain is I such that f(i) € 5; for alli € I. 

The well-ordering principle, the maximum principle, Zorn’s lemma, and the axiom of 
choice are equivalent in the sense that if any one of these statements is added to the basic 
axioms of set theory, all the others can be proved. The statements themselves cannot be 
proved from the basic axioms. Constructivist mathematics rejects the axiom of choice 
and its equivalents. In this philosophy, an assertion that we can choose an element from 
each S$; must be accompanied by an explicit algorithm. The idea is appealing, but its 
acceptance results in large areas of interesting and useful mathematics being tossed onto 
the scrap heap. So at present, the mathematical mainstream embraces the axiom of 
choice, Zorn’s lemma et al. 
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0.2.7 Proof by Transfinite Induction 


To prove that statement P; holds for all 7 in the well-ordered set I, we do the following: 
1. Prove the basis step Po, where 0 is the smallest element of I. 
2. If i > 0 and we assume that P; holds for all j < i (the transfinite induction 
hypothesis), prove P;. 
It follows that P; is true for all 7. 


0.2.8 


We say that the size of the set A is less than or equal to the size of B (notation A <, B) if 
there is an injective map from A to B. We say that A and B have the same size (A =, B) 
if there is a bijection between A and B. 


0.2.9 Schrodder-Bernstein Theorem 
If A<, Band B<, A, then A=, B. (This can be proved without the axiom of choice.) 


0.2.10 


Using (0.2.9), one can show that if sets of the same size are called equivalent, then <, 
on equivalence classes is a partial ordering. It follows with the aid of Zorn’s lemma that 
the ordering is total. The equivalence class of a set A, written |A|, is called the cardinal 
number or cardinality of A. In practice, we usually identify |A| with any convenient 
member of the equivalence class, such as A itself. 


0.2.11 


For any set A, we can always produce a set of greater cardinality, namely the power set 24, 
that is, the collection of all subsets of A. 


0.2.12 


Define addition and multiplication of cardinal numbers by |A|+|B] = |AUB| and |A||B| = 
|A x B|. In defining addition, we assume that A and B are disjoint. (They can always be 
disjointized by replacing a € A by (a,0) and b € B by (8, 1).) 


0.2.13 


If No is the cardinal number of a countably infinite set, then Ng + No = NoXo = No. More 
generally, 


(a) If @ and @ are cardinals, with a < @ and @ infinite, then a+ 6 = £. 
(b) If a0 (ie., a is nonempty), a < 6 and @ is infinite, then af = £. 


0.2.14 


If A is an infinite set, then A and the set of all finite subsets of A have the same cardinality. 
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0.3. Linear Algebra 


It is not feasible to list all results presented in an undergraduate course in linear algebra. 
Instead, here is a list of topics that are covered in a typical course. 
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11. 


Sums, products, transposes, inverses of matrices; symmetric matrices. 
Elementary row and column operations; reduction to echelon form. 
Determinants: evaluation by Laplace expansion and Cramer’s rule. 
Vector spaces over a field; subspaces, linear independence and bases. 
Rank of a matrix; homogeneous and nonhomogeneous linear equations. 
Null space and range of a matrix; the dimension theorem. 

Linear transformations and their representation by matrices. 
Coordinates and matrices under change of basis. 

Inner product spaces and the projection theorem. 


Eigenvalues and eigenvectors; diagonalization of matrices with distinct eigenvalues, 
symmetric and Hermitian matrices. 


Quadratic forms. 


A more advanced course might cover the following topics: 


12. 
13. 
14. 
15. 
16. 


Generalized eigenvectors and the Jordan canonical form. 

The minimal and characteristic polynomials of a matrix; Cayley-Hamilton theorem. 
The adjoint of a linear operator. 

Projection operators. 


Normal operators and the spectral theorem. 
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Chapter 1 


Group Fundamentals 


1.1 Groups and Subgroups 


1.1.1 Definition 


A group is a nonempty set G on which there is defined a binary operation (a,b) — ab 
satisfying the following properties. 


Closure: If a and b belong to G, then ab is also in G; 
Associativity: a(bc) = (ab)c for all a,b,c € G; 


Identity: There is an element 1 € G such that al = la = a for all a in G; 
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Inverse: If a is in G, then there is an element a~! in G such that aa~! = a~'a = 1. 


A group G is abelian if the binary operation is commutative, i.e., ab = ba for all a,b 
in G. In this case the binary operation is often written additively ((a,b) — a+ 6), with 
the identity written as 0 rather than 1. 

There are some very familiar examples of abelian groups under addition, namely the 
integers Z, the rationals Q, the real numbers R, the complex numers C, and the integers 
Zm, modulo m. Nonabelian groups will begin to appear in the next section. 

The associative law generalizes to products of any finite number of elements, for exam- 
ple, (ab) (cde) = a(bcd)e. A formal proof can be given by induction. If two people A and 
B form a,---@, in different ways, the last multiplication performed by A might look like 
(a1 +++ @;)(@i41°-+@n), and the last multiplication by B might be (a1 ---a@;)(@j41--- an). 
But if (without loss of generality) i < j, then (induction hypothesis) 


(a1 +++ aj) = (a1 +++ i) (@i41 +++ @;) 
and 
(Qi41 +++ Qn) = (@i4z1 +++ Gj) (Qj41°++ Gn). 


By the n = 3 case, i.e., the associative law as stated in the definition of a group, the 
products computed by A and B are the same. 
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The identity is unique (1’ = 1/1 = 1), as is the inverse of any given element (if b and b’ 
are inverses of a, then b = 1b = (0/a)b = b'(ab) = b'1 = Db’). Exactly the same argument 
shows that if b is a right inverse, and 0’ a left inverse, of a, then b= 0b’. 


1.1.2 Definitions and Comments 


A subgroup H of a group G is a nonempty subset of G that forms a group under the 
binary operation of G. Equivalently, H is a nonempty subset of G such that if a and b 
belong to H, so does ab~!. (Note that 1 = aa~! € H; also, ab= a((b-+)~") € H.) 

If A is any subset of a group G, the subgroup generated by A is the smallest subgroup 
containing A, often denoted by (A). Formally, (A) is the intersection of all subgroups 
containing A. More explicitly, (A) consists of all finite products a1---an,, n = 1,2,..., 
where for each 7, either a; or a belongs to A. To see this, note that all such products 
belong to any subgroup containing A, and the collection of all such products forms a 
subgroup. In checking that the inverse of an element of (A) also belongs to (A), we use 
the fact that 


(A+++ Qn) =a, 7 Qy 


n 


1.1.3 Definitions and Comments 


The groups G, and G2 are said to be isomorphic if there is a bijection f: G; — G2 that 
preserves the group operation, in other words, f(ab) = f(a) f(b). Isomorphic groups are 
essentially the same; they differ only notationally. Here is a simple example. A group G 
is cyclic if G is generated by a single element: G = (a). A finite cyclic group generated 
by a is necessarily abelian, and can be written as {1,a,a7,...,a"~1} where a” = 1, or in 
additive notation, {0,a,2a,...,(n — 1)a}, with na = 0. Thus a finite cyclic group with 
n elements is isomorphic to the additive group Z,, of integers modulo n. Similarly, if G 
is an infinite cyclic group generated by a, then G must be abelian and can be written as 
{1,a*!,a*?,...}, or in additive notation as {0, +a, +2a,...}. In this case, G is isomorphic 
to the additive group Z of all integers. 

The order of an element a in a group G (denoted |a]) is the least positive integer n such 
that a” = 1; if no such integer exists, the order of a is infinite. Thus if |a] = n, then the 
cyclic subgroup (a) generated by a has exactly n elements, and a’ = 1 iff k is a multiple 
of n. (Concrete examples are more illuminating than formal proofs here. Start with 0 in 
the integers modulo 4, and continually add 1; the result is 0,1, 2,3,0,1,2,3,0,1,2,3,....) 

The order of the group G, denoted by |G], is simply the number of elements in G. 


1.1.4 Proposition 


If G is a finite cyclic group of order n, then G has exactly one (necessarily cyclic) subgroup 
of order n/d for each positive divisor d of n, and G has no other subgroups. If G is an infi- 
nite cyclic group, the (necessarily cyclic) subgroups of G are of the form {1, b*1, b*?,...}, 
where 0 is an arbitrary element of G, or, in additive notation, {0, +b, +2b,...}. 
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Proof. Again, an informal argument is helpful. Suppose that H is a subgroup of Zo (the 
integers with addition modulo 20). If the smallest positive integer in H is 6 (a non-divisor 
of 20) then H contains 6,12, 18,4 (oops, a contradiction, 6 is supposed to be the smallest 
positive integer). On the other hand, if the smallest positive integer in H is 4, then H = 
{4,8,12,16,0}. Similarly, if the smallest positive integer in a subgroup H of the additive 
group of integers Z is 5, then H = {0,+5,+10,+15,+20,...}. & 


If G = {1,a,...,a"~1} is a cyclic group of order n, when will an element a” also have 
order n? To discover the answer, let’s work in Z,2. Does 8 have order 12? We compute 
8,16, 24 (= 0), so the order of 8 is 3. But if we try 7, we get 7,14,21,...,77,84=7 x 12, 
so 7 does have order 12. The point is that the least common multiple of 7 and 12 is 
simply the product, while the lcm of 8 and 12 is smaller than the product. Equivalently, 
the greatest common divisor of 7 and 12 is 1, while the gcd of 8 and 12 is 4 > 1. We have 
the following result. 


1.1.5 Proposition 


If G is acyclic group of order n generated by a, the following conditions are equivalent: 


(a) ja] =n. 
(b) r and n are relatively prime. 


(c) ris a unit mod n, in other words, r has an inverse mod n (an integer s such that 
rs =1mod n). 


Furthermore, the set U, of units mod n forms a group under multiplication. The order 
of this group is y(n) = the number of positive integers less than or equal to n that are 
relatively prime to n; ¢ is the familiar Euler y function. 


Proof. The equivalence of (a) and (b) follows from the discussion before the statement 
of the proposition, and the equivalence of (b) and (c) is handled by a similar argument. 
For example, since there are 12 distinct multiples of 7 mod 12, one of them must be 1; 
specifically, 7 x 7 = 1 mod 12. But since 8 x 3 is 0 mod 12, no multiple of 8 can be 
1 mod 12. (If 82 = 1, multiply by 3 to reach a contradiction.) Finally, U, is a group 
under multiplication because the product of two integers relatively prime to n is also 
relatively prime ton. d& 


Problems For Section 1.1 


1. A semigroup is a nonempty set with a binary operation satisfying closure and asso- 
ciativity (we drop the identity and inverse properties from the definition of a group). 
A monoid is a semigroup with identity (so that only the inverse property is dropped). 
Give an example of a monoid that is not a group, and an example of a semigroup 
that is not a monoid. 


2. In Ze, the group of integers modulo 6, find the order of each element. 


3. List all subgroups of Ze. 


10. 


11. 


12. 


13. 


14. 


15. 
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. Let S be the set of all n by n matrices with real entries. Does S form a group under 


matrix addition? 


. Let S* be the set of all nonzero n by n matrices with real entries. Does S* form a 


group under matrix multiplication? 


. If H is a subgroup of the integers Z and H 4 {0}, what does H look like? 


. Give an example of an infinite group that has a nontrivial finite subgroup (trivial 


means consisting of the identity alone). 


. Let a and b belong to the group G. If ab = ba and |a| = m, |b| = n, where m and n 


are relatively prime, show that |ab| = mn and that (a)  (b) = {1}. 


. If Gis a finite abelian group, show that G has an element g such that |g| is the least 


common multiple of {|a|: a € G}. 


Show that a group G cannot be the union of two proper subgroups, in other words, if 
G = HUK where H and K are subgroups of G, then H = G or K = G. Equivalently, 
if H and K are subgroups of a group G, then H U K cannot be a subgroup unless 
HACKork Cd. 


In an arbitrary group, let a have finite order n, and let k be a positive integer. If 
(n, k) is the greatest common divisor of n and k, and [n, k] the least common multiple, 
show that the order of a” is n/(n,k) = [n, k]/k. 


Suppose that the prime factorization of the positive integer n is 
7 = py" py? ++ py” 


and let A; be the set of all positive integers m € {1,2,...,n} such that p; divides m. 
Show that if |.S'| is the number of elements in the set S, then 


n 
| Ai 4 
Pi 
JANA) =—— forié J, 
PiPj 
|A; M Aj M Ax | = for a4; k distinct, 
iPjPk 


and so on. 


Continuing Problem 12, show that the number of positive integers less than or equal 
to n that are relatively prime to n is 


Give an example of a finite group G (of order at least 3) with the property that the 
only subgroups of G are {1} and G itself. 


Does an infinite group with the property of Problem 14 exist? 
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1.2 Permutation Groups 


1.2.1 Definition 


A permutation of a set S is a bijection on S, that is, a function 7: S — S that is one- 
to-one and onto. (If S$ is finite, then 7 is one-to-one if and only if it is onto.) If S is not 
too large, it is feasible to describe a permutation by listing the elements x € S' and the 
corresponding values 7(a). For example, if S = {1,2,3,4,5}, then 


1 2 3 4 5 

3.5 4 1 2 
is the permutation such that (1) = 3, 7(2) = 5, (3) = 4, 7(4) = 1, 7(5) = 2. If we 
start with any element « € S and apply a repeatedly to obtain (x), a(a(x)), m(m(a(2))), 
and so on, eventually we must return to x, and there are no repetitions along the way 


because 7 is one-to-one. For the above example, we obtain 1 — 3 —~ 4 1,2 —~5 — 2. 
We express this result by writing 


mw = (1,3,4)(2,5) 


where the cycle (1,3,4) is the permutation of S that maps 1 to 3, 3 to 4 and 4 to 1, 
leaving the remaining elements 2 and 5 fixed. Similarly, (2,5) maps 2 to 5, 5 to 2, 1 to 1, 
3 to 3 and 4 to 4. The product of (1,3,4) and (2,5) is interpreted as a composition, with 
the right factor (2,5) applied first, as with composition of functions. In this case, the 
cycles are disjoint, so it makes no difference which mapping is applied first. 

The above analysis illustrates the fact that any permutation can be expressed as a 
product of disjoint cycles, and the cycle decomposition is unique. 


1.2.2 Definitions and Comments 


A permutation 7 is said to be even if its cycle decomposition contains an even number 
of even cycles (that is, cycles of even length); otherwise 7 is odd. A cycle can be de- 
composed further into a product of (not necessarily disjoint) two-element cycles, called 
transpositions. For example, 


(1, 2,3,4,5) = (1,5)(1, 4)(1, 3)(1, 2) 


where the order of application of the mappings is from right to left. 
Multiplication by a transposition changes the parity of a permutation (from even to 
odd, or vice versa). For example, 


(2,4) (1, 2,3, 4,5) = (2,3)(1,4, 5) 
(2,6) (1, 2,3, 4,5) = (1,6, 2,3, 4,5); 


(1,2,3,4,5) has no cycles of even length, so is even; (2,3)(1,4,5) and (1,6,2,3,4,5) each 
have one cycle of even length, so are odd. 

Since a cycle of even length can be expressed as the product of an odd number of 
transpositions, we can build an even permutation using an even number of transpositions, 
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and an odd permutation requires an odd number of transpositions. A decomposition into 
transpositions is not unique; for example, (1,2,3,4,5) = (1, 4)(1,5)(1, 4)(1,3)(1, 2)(3, 5), 
but as mentioned above, the cycle decomposition is unique. Since multiplication by a 
transposition changes the parity, it follows that if a permutation is expressed in two 
different ways as a product of transpositions, the number of transpositions will agree in 
parity (both even or both odd). 

Consequently, the product of two even permutations is even; the product of two odd 
permutations is even; and the product of an even and an odd permutation is odd. To 
summarize very compactly, define the sign of the permutation 7 as 


(7m) +1 if m7 is even 
sgn(7) = 
6 —1 if is odd 


Then for arbitrary permutations 7, and 72 we have 


sen(7172) = sgn(71) sgn(72). 


1.2.3 Definitions and Comments 


There are several permutation groups that are of major interest. The set S,, of all per- 
mutations of {1,2,...,n} is called the symmetric group on n letters, and its subgroup Ay, 
of all even permutations of {1,2,...,n} is called the alternating group on n letters. (The 
group operation is composition of functions.) Since there are as many even permutations 
as odd ones (any transposition, when applied to the members of S,,, produces a one-to-one 
correspondence between even and odd permutations), it follows that A,, is half the size 
of S,,. Denoting the size of the set S by |S|, we have 


|Snl=nt, |An| = 9n! 


We now define and discuss informally Dz,, the dihedral group of order 2n. Consider 
a regular polygon with center O and vertices Vj, V2,..., Vn, arranged so that as we move 
counterclockwise around the figure, we encounter V,, V3,... in turn. To eliminate some of 
the abstraction, let’s work with a regular pentagon with vertices A, B,C, D, E, as shown 


Ot 
se 


Figure 1.2.1 


The group Do consists of the symmetries of the pentagon, i.e., those permutations 
that can be realized via a rigid motion (a combination of rotations and reflections). All 
symmetries can be generated by two basic operations R and F: 
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R is counterclockwise rotation by 360 = see = 72 degrees, 

F (“flip”) is reflection about the line joining the center O to the first vertex (A in this 
case). 

The group Dz, contains 2n elements, namely, J (the identity), R,R?,...,R"~+, F, 
RF, R?F, ..., R"-'F (RF means F followed by R). For example, in the case of the 
pentagon, F = (B, E)(C,D) and R = (A,B,C,D,E), so RF = (A, B)(C, E), which is 
the reflection about the line joining O to D; note that RF can also be expressed as FR™!. 
In visualizing the effect of a permutation such as F’, interpret F’s taking B to E as vertex 
B moving to where vertex E' was previously. 

Do», will contain exactly n rotations J, R,...,R”~! and n reflections F, RF,...,R°~'F. 
If n is odd, each reflection is determined by a line joining the center to a vertex (and pass- 
ing through the midpoint of the opposite side). If n is even, half the reflections are 
determined by a line passing through two vertices (as well as the center), and the other 
half by a line passing through the midpoints of two opposite sides (as well as the center). 


1.2.4 An Abstract Characterization of the Dihedral Group 


Consider the free group with generators R and F, in other words all finite sequences whose 
components are R,R~',F and F~!. The group operation is concatenation, subject to 
the constraint that if a symbol and its inverse occur consecutively, they may be can- 
celled. For example, RFF FF-!RFR-'!RFF is identified with RFF RF FF, also written 
as RF?RF°?. If we add further restrictions (so the group is no longer “free”), we can 
obtain D2,. Specifically, Do, is the group defined by generators R and F,, subject to the 
relations 


R°=I, F*=I1, RF=FR. 


The relations guarantee that there are only 2n distinct group elements J, R,...,R"~+ and 
F,RF,...,R"~'F. For example, with n = 5 we have 


F?R?F = FFRRF = FFRFR"' = FFFR'R 1 = FR? = FR’; 


also, R cannot be the same as R?F, since this would imply that J = RF, or F = R7' = 
R‘, and there is no way to get this using the relations. Since the product of two group 
elements is completely determined by the defining relations, it follows that there cannot 
be more than one group with the given generators and relations. (This statement is true 
“up to isomorphism” ; it is always possible to create lots of isomorphic copies of any given 
group.) The symmetries of the regular n-gon provide a concrete realization. 

Later we will look at more systematic methods of analyzing groups defined by gener- 
ators and relations. 


Problems For Section 1.2 
1. Find the cycle decomposition of the permutation 


12 3 4 5 6 
4 6 3 1 2 5 


and determine whether the permutation is even or odd. 
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2. Consider the dihedral group Dg as a group of permutations of the square. Assume that 
as we move counterclockwise around the square, we encounter the vertices A, B,C, D 
in turn. List all the elements of Dg. 


3. In Ss, how many 5-cycles are there; that is, how many permutations are there with 
the same cycle structure as (1,2,3,4,5)? 


4. In $5, how many permutations are products of two disjoint transpositions, such as 
(1, 2)(3,4)? 
5. Show that if n > 3, then S;, is not abelian. 


6. Show that the products of two disjoint transpositions in S4, together with the identity, 
form an abelian subgroup V of $4. Describe the multiplication table of V (known as 
the four group). 

7. Show that the cycle structure of the inverse of a permutation 7 coincides with that 
of 7. In particular, the inverse of an even permutation is even (and the inverse of an 
odd permutation is odd), so that A, is actually a group. 

8. Find the number of 3-cycles, i.e., permutations consisting of exactly one cycle of 
length 3, in S4. 

9. Suppose H is a subgroup of A, with the property that for every permutation 7 in Ag, 
7? belongs to H. Show that H contains all 3-cycles in Ay. (Since 3-cycles are even, 
H in fact contains all 3-cycles in S4.) 


fl 2 3 4 °5 
OSD Ae Bl? 
Count the number of inversions of 7, that is, the number of pairs of integers that are 


out of their natural order in the second row of 7. For example, 2 and 5 are in natural 
order, but 4 and 3 are not. Compare your result with the parity of 7. 


10. Consider the permutation 


11. Show that the parity of any permutation 7 is the same as the parity of the number 
of inversions of 7. 


1.3. Cosets, Normal Subgroups, and Homomorphisms 
1.3.1 Definitions and Comments 
Let H be a subgroup of the group G. If g € G, the right coset of H generated by g is 
Hg = {hg: he FH}; 
similarly, the left coset of H generated by g is 
gH ={gh: he H}. 
It follows (Problem 1) that if a,b € G, then 


Ha=Hb ifandonlyif abt'¢H 
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and 
aH =bH if andonly if a-'be H. 


Thus if we define a and b to be equivalent iff ab~' € H, we have an equivalence relation 
(Problem 2), and (Problem 3) the equivalence class of a is 


{b: ab-+ € H} = Ha. 


Therefore the right cosets partition G (similarly for the left cosets). Since h — ha, 
h € H, is a one-to-one correspondence, each coset has |H| elements. There are as many 
right cosets as left cosets, since the map aH — Ha! is a one-to-one correspondence 
(Problem 4). If [|G : H], the index of H in G, denotes the number of right (or left) cosets, 
we have the following basic result. 


1.3.2 Lagrange’s Theorem 


If H is a subgroup of G, then |G| = |H|[G : H]. In particular, if G is finite then |H| 
divides |G], and 
IG] _ 


lay ee 


Proof. There are [G : H] cosets, each with |H| members. & 


1.3.3. Corollary 


Let G be a finite group. 

(i) If a € G then |a| divides |G|; in particular, al@! = 1. Thus |G| is a multiple of the 
order of each of its elements, so if we define the exponent of G to be the least common 
multiple of {|a|: a € G}, then |G| is a multiple of the exponent. 

(ii) If G has prime order, then G is cyclic. 


Proof. If the element a € G has order n, then H = {1,a,a”,...,a"~+} is a cyclic subgroup 
of G with |H| = n. By Lagrange’s theorem, n divides |G|, proving (i). If |G| is prime 
then we may take a 4 1, and consequently n = |G|. Thus H is a subgroup with as many 
elements as G, so in fact H and G coincide, proving (ii). & 


Here is another corollary. 


1.3.4 Euler’s Theorem 


If a and n are relatively prime positive integers, with n > 2, then a?) =1 mod n. A 
special case is Fermat’s Little Theorem: If p is a prime and a is a positive integer not 
divisible by p, then a?~! =1 mod p. 


Proof. The group of units mod n has order y(n), and the result follows from (1.3.3). & 


We will often use the notation H < G to indicate that H is a subgroup of G. If H is 
a proper subgroup, i.e., H < G but H #G, we write H < G. 
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1.3.5 The Index is Multiplicative 
If kK <H<G, then |G: kK] =|G: H][H: K). 


Proof. Choose representatives a; from each left coset of H in G, and representatives b; 
from each left coset of K in H. If cK is any left coset of kK in G, then c € a;H for some 
unique 7, and if c = ajh,h € A, then h € b;K for some unique J, so that c belongs to 
a,b; K. The map (a;,6;) — a,b; is therefore onto, and it is one-to-one by the uniqueness 
of i and 7. We therefore have a bijection between a set of size [G : H][H : K] and a set 
of size [G: K], as asserted. df 


Now suppose that H and K are subgroups of G, and define HK to be the set of all 
products hk,h € H,k © K. Note that HK need not be a group, since hykyh2k2 is not 
necessarily equal to hy hgk kg. If G is abelian, then HK will be a group, and we have the 
following useful generalization of this observation. 


1.3.6 Proposition 


If H<Gand K <G, then HK < G if and only if Hk = KH. In this case, HK is the 
subgroup generated by HU Kk. 


Proof. If HK is asubgroup, then (H K)~1, the collection of all inverses of elements of HK, 
must coincide with HK. But (HK)~' = K~'H-! = KH. Conversely, if HK = KH, 
then the inverse of an element in HK also belongs to HK, because (HK)~! = K~!H~1+ = 
KH = HK. The product of two elements in HK belongs to HK, because (HK)(HK) = 
AKHK = HHKK = HK. The last statement follows from the observation that any 
subgroup containing H and kK must contain HK. d& 


The set product HK defined above suggests a multiplication operation on cosets. If 
HA is a subgroup of G, we can multiply aH and bH, and it is natural to hope that we get 
abH. This does not always happen, but here is one possible criterion. 


1.3.7 Lemma 


If H < G, then (aH)(bH) = abH for all a,b € G iff cHc~! = H for all c € G. (Equiva- 
lently, cH = Hc for all cE G. 


Proof. If the second condition is satisfied, then (aH)(bH) = a(Hb)H = abHH = abH. 
Conversely, if the first condition holds, then cHc~! C cHc7'H since 1 € H, and 
(cH)(c-'H) = cc-!H(= H) by hypothesis. Thus cHc~! C H, which implies that 
H C c'He. Since this holds for all c € G, we have H C cHc™!, and the result fol- 
lows. & 


Notice that we have proved that if cHc~! C H for all c € G, then in fact cHc~! = H 
for all cE G. 
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1.3.8 Definition 


Let H be a subgroup of G. If any of the following equivalent conditions holds, we say 
that H is a normal subgroup of G, or that H is normal in G: 


Every left coset of H in G is also a right coset. 


Every right coset of H in G is also a left coset. 


We have established the equivalence of (1), (2) and (3) above, and (3) immediately 
implies (4). To show that (4) implies (3), suppose that cH = Hd. Then since c belongs 
to both cH and Hc, i.e., to both Hd and Hc, we must have Hd = Hc because right 
cosets partition G, so that any two right cosets must be either disjoint or identical. The 
equivalence of (5) is proved by a symmetrical argument. 

Notation: H <j G indicates that H is a normal subgroup of G; if H is a proper 
normal subgroup, we write H <G. 


1.3.9 Definition of the Quotient Group 


If H is normal in G, we may define a group multiplication on cosets, as follows. If aH 
and bH are (left) cosets, let 


(aH)(bH) = abH; 


by (1.3.7), (aH)(bH) is simply the set product. If a1 is another member of aH and by, 
another member of bH, then a;H = aH and b,H = bH (Problem 5). Therefore the set 
product of a;H and 6H is also abH. The point is that the product of two cosets does 
not depend on which representatives we select. 

To verify that cosets form a group under the above multiplication, we consider the 
four defining requirements. 


Closure: The product of two cosets is a coset. 
Associativity: This follows because multiplication in G is associative. 
Identity: The coset 1H = H serves as the identity. 
Inverse: The inverse of aH is a~'H. 
The group of cosets of a normal subgroup N of G is called the quotient group of G 
by N; it is denoted by G/N. 


Since the identity in G/N is 1N = N, we have, intuitively, “set everything in N equal 
to 1”. 
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1.3.10 Example 


Let GL(n,R) be the set of all nonsingular n by n matrices with real coefficients, and 
let SZ(n, R) be the subgroup formed by matrices whose determinant is 1 (GL stands for 
“general linear” and SL for “special linear”). Then S'L(n, R) «GL(n, R), because if A is 
a nonsingular n by n matrix and B is n by n with determinant 1, then det(ABA~!) = 
det A det B det A~' = det B = 1. 


1.3.11 Definition 


If f: G— H, where G and H are groups, then f is said to be a homomorphism if for all 
a,b in G, we have 


fab) = f(a) f(). 


This idea will look familiar if G and H are abelian, in which case, using additive notation, 
we write 


f(a+ 6) = f(a) + F(6); 


thus a linear transformation on a vector space is, in particular, a homomorphism on the 
underlying abelian group. If f is a homomorphism from G to H, it must map the identity 
of G to the identity of H, since f(a) = f(alc) = f(a)f(1c); multiply by f(a)~* to get 
ly = f(1c). Furthermore, the inverse of f(a) is f(a~+), because 


so that [f(a)]~' = f(a~*). 


1.3.12 The Connection Between Homomorphisms and Normal 
Subgroups 


If f: G — H is a homomorphism, define the kernel of f as 
ker f = {ae G: f(a) =1}; 


then ker f is a normal subgroup of G. For if a € G and b € ker f, we must show that 
aba~' belongs to ker f. But f(aba~!) = f(a) f(b) f(a~+) = f(a)(1)f(a)-! = 1. 

Conversely, every normal subgroup is the kernel of a homomorphism. To see this, 
suppose that N < G, and let H be the quotient group G/N. Define the map 7: G > G/N 
by (a) = aN; 7 is called the natural or canonical map. Since 


m(ab) = abN = (aN)(bN) = z(a)r(0), 


m is a homomorphism. The kernel of 7 is the set of all a € G such that aN = N(=1N), 
or equivalently, a € N. Thus kera = N. 
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1.3.13 Proposition 


A homomorphism f is injective if and only if its kernel K is trivial, that is, consists only 
of the identity. 


Proof. If f is injective and a € K, then f(a) = 1 = f(1), hence a = 1. Conversely, if K is 
trivial and f(a) = f(b), then f(ab~*) = f(a) f(0-*) = fl@[fO)]-* = f@[F(@]~* = 1, 
so ab-! € K. Thus ab-! = 1, i.e., a=, proving f injective. & 


1.3.14 Some Standard Terminology 

A monomorphism is an injective homomorphism 

An epimorphism is a surjective homomorphism 

An isomorphism is a bijective homomorphism 

An endomorphism is a homomorphism of a group to itself 
An automorphism is an isomorphism of a group with itself 


We close the section with a result that is applied frequently. 


1.3.15 Proposition 


Let f: G— H be a homomorphism. 

(i) If K is a subgroup of G, then f(A) is a subgroup of H. If f is an epimorphism 
and i is normal, then f() is also normal. 

(ii) If K is a subgroup of H, then f~!(K) is a subgroup of G. If K is normal, so is 
f-"(K). 


Proof. (i) If f(a) and f(b) belong to f(K), so does f(a) f(b)~+, since this element coin- 
cides with f(ab~'). If K is normal and c € G, we have f(c)f(K)f(c)~+ = f(cKke"') = 
f(K), so if f is surjective, then f(A) is normal. 

(ii) If a and b belong to f~!(K), so does ab~', because f(ab~') = f(a) f(b)~+, which 
belongs to K. If c € Ganda € f~'(K) then f(cac') = f(c)f(a)f(o)~, so if K is 
normal, we have cac~' € f~1(K), proving f~!(K) normal. d& 


Problems For Section 1.3 


In Problems 1-6, H is a subgroup of the group G, and a and 0 are elements of G. 


1. Show that Ha = Hb iff ab"! € H. 
2. Show that “a ~ b iff ab~! € H” defines an equivalence relation. 


3. If we define a and b to be equivalent iff ab~! € H, show that the equivalence class 
of ais Ha. 


4. Show that aH — Ha~! is a one-to-one correspondence between left and right cosets 
of H. 


14 


12. 


13. 


14. 
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. If a is a left coset of H in G and a, € aH, show that the left coset of H generated 


by ay (ie., a1), is also aH. 


. If [G: H] = 2, show that H is a normal subgroup of G. 
. Let $3 be the group of all permutations of {1,2,3}, and take a to be permutation 


(1,2,3), 6 the permutation (1,2), and e the identity permutation. Show that the 
elements of $3 are, explicitly, e, a, a7, b, ab and a?b. 


. Let H be the subgroup of $3 consisting of the identity e and the permutation b = (1, 2). 


Compute the left cosets and the right cosets of H in S3. 


. Continuing Problem 8, show that H is not a normal subgroup of S3. 


. Let f be an endomorphism of the integers Z. Show that f is completely determined 


by its action on 1. If f(1) =r, then f is multiplication by r; in other words, f(n) = rn 
for every integer n. 


. If f is an automorphism of Z, and I is the identity function on Z, show that f is 


either J or —J. 


Since the composition of two automorphisms is an automorphism, and the inverse of 
an automorphism is an automorphism, it follows that the set of automorphisms of a 
group is a group under composition. In view of Problem 11, give a simple description 
of the group of automorphisms of Z. 


Let H and K be subgroups of the group G. If x,y € G, define x ~ y iff x can be 
written as hyk for some h € H and k € K. Show that ~ is an equivalence relation. 


The equivalence class of  € Gis HaK = {hak: h € H,k € K}, called a double coset 
associated with the subgroups H and Kk. Thus the double cosets partition G. Show 
that any double coset can be written as a union of right cosets of H, or equally well 
as a union of left cosets of K. 


1.4 The Isomorphism Theorems 


Suppose that N is a normal subgroup of G, f is a homomorphism from G to H, and 7 is 
the natural map from G to G/N, as pictured in Figure 1.4.1. 


Figure 1.4.1 


We would like to find a homomorphism f: G/N — H that makes the diagram com- 


mutative. Commutativity means that we get the same result by traveling directly from G 
to H via f as we do by taking the roundabout route via 7 followed by f. This requirement 
translates to f(aN) = f(a). Here is the key result for finding such an f. 
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1.4.1 Factor Theorem 


Any homomorphism f whose kernel K contains N can be factored through G/N. In other 
words, in Figure 1.4.1 there is a unique homomorphism f: G/N — H such that fom = f. 
Furthermore: 

(i) f is an epimorphism if and only if f is an epimorphism; 

(ii) f is a monomorphism if and only if K = N; 


(iii) f is an isomorphism if and only if f is an epimorphism and K = N. 


Proof. If the diagram is to commute, then f(aN) must be f(a), and it follows that f, if 
it exists, is unique. The definition of f that we have just given makes sense, because if 
aN = dN, then a~'b€ N C K, so f(a~*b) = 1, and therefore f(a) = f(b). Since 


f(aNbN) = f(abN) = fad) = f(a) f(b) = f(aN) FON), 


f is a homomorphism. By construction, f has the same image as f, proving (i). Now the 
kernel of f is 


{aN: f(a) =1}={aN: ae K}=K/N. 


By (1.3.13), a homomorphism is injective, i-e., a monomorphism, if and only if its kernel 
is trivial. Thus f is a monomorphism if and only if K/N consists only of the identity 
element N. This means that if a is any element of K, then the coset aN coincides with 
N, which forces a to belong to N. Thus f is a monomorphism if and only if K = N, 
proving (ii). Finally, (iii) follows immediately from (i) and (ii). & 


The factor theorem yields a fundamental result. 


1.4.2 First Isomorphism Theorem 


If f: G — A is a homomorphism with kernel K, then the image of f is isomorphic 
to G/K. 


Proof. Apply the factor theorem with N = K, and note that f must be an epimorphism 
of G onto its image. & 


If we are studying a subgroup K of a group G, or perhaps the quotient group G/K, 
we might try to construct a homomorphism f whose kernel is K and whose image H has 
desirable properties. The first isomorphism theorem then gives G/K & H (where ©& is 
our symbol for isomorphism). If we know something about H, we may get some insight 
into K and G/K. 

We will prove several other isomorphism theorems after the following preliminary 


result. 
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1.4.3 Lemma 
Let H and N be subgroups of G, with N normal in G. Then: 


(i) HN = NH, and therefore by (1.3.6), HN is a subgroup of G. 
(ii) N is a normal subgroup of HN. 
(iii) HON is a normal subgroup of H. 


Proof. (i) We have hN = Nh for every h € G, in particular for every h € H. 
(ii) Since N is normal in G, it must be normal in the subgroup HN. 
(iii) HN is the kernel of the canonical map 7: G — G/N, restricted to H. & 


The subgroups we are discussing are related by a “parallelogram” or “diamond”, as 
Figure 1.4.2 suggests. 


HN 
A x we N 
HON 
Figure 1.4.2 


1.4.4 Second Isomorphism Theorem 
If H and N are subgroups of G, with N normal in G, then 

H/(HAN)~HNIN. 
Note that we write HN/N rather than H/N, since N need not be a subgroup of H. 
Proof. Let be the canonical epimorphism from G to G/N, and let 7 be the restriction 
of x to H. Then the kernel of 79 is HNN, so by the first isomorphism theorem, H/(HNN) 


is isomorphic to the image of 7, which is {hN: h € H} = HN/N. (To justify the last 
equality, note that for any n € N we havehnN=hN). & 


1.4.5 Third Isomorphism Theorem 


If N and H are normal subgroups of G, with N contained in H, then 
G/H = (G/N)/(H/N), 


a “cancellation law”. 
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Proof. This will follow directly from the first isomorphism theorem if we can find an 
epimorphism of G/N onto G/H with kernel H/N, and there is a natural candidate: 
f(aN) = aH. To check that f is well-defined, note that if aN =bN then a~!be N CH, 
so aH = bH. Since a is an arbitrary element of G, f is surjective, and by definition of 
coset multiplication, f is a homomorphism. But the kernel of f is 


{aN: aH =H}={aN:ac H}=H/N. & 


Now suppose that N is a normal subgroup of G. If H is a subgroup of G containing 
N, there is a natural analog of H in the quotient group G/N, namely, the subgroup H/N. 
In fact we can make this correspondence very precise. Let 


(A) = H/N 


be a map from the set of subgroups of G containing N to the set of subgroups of G/N. 
We claim that ~ is a bijection. For if H,/N = H2/N then for any h, € H,, we have 
hiN = h2N for some hg € Ho, so that hy hi € N, which is contained in Hz. Thus 
A, C Ho, and by symmetry the reverse inclusion holds, so that H, = Hp» and w is 
injective. Now if Q is a subgroup of G/N and 7: G — G/N is canonical, then 


na *(Q)={a€G: aN € Q}, 
a subgroup of G containing N, and 


v(m" (Q)) = {aN: aN € Q} =Q, 


proving ~ surjective. 
The map w has a number of other interesting properties, summarized in the following 
result, sometimes referred to as the fourth isomorphism theorem. 


1.4.6 Correspondence Theorem 


If N is a normal subgroup of G, then the map w: H — H/N sets up a one-to-one 
correspondence between subgroups of G containing N and subgroups of G/N. The inverse 
of ~ is the map tT: Q — 7~1(Q), where 7 is the canonical epimorphism of G onto G/N. 
Furthermore: 
(i) AH, < He if and only if H,/N < H2/N, and, in this case, 
[H>: Ay) = [H2/N: H,/N). 


(ii) H is a normal subgroup of G if and only if H/N is a normal subgroup of G/N. 
More generally, 


(iii) Hy is a normal subgroup of Hy if and only if H,/N is a normal subgroup of H2/N, 
and in this case, H)/H, = (H2/N)/H,/N). 
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Proof. We have established that w is a bijection with inverse 7. If H, < Hz, we have 
H,/N < H2/N immediately, and the converse follows from the above proof that w is 
injective. To prove the last statement of (i), let 7 map the left coset aHi,a € Ho, to the 
left coset (aN)(H,/N). Then 7 is a well-defined injective map because 


aH, =bH, iff a be My 
iff (aN)~!(bN) =a” ‘dN € H,/N 
iff (aN)(H,/N) = (0N)(H1/N); 


n is surjective because a ranges over all of He. 
To prove (ii), assume that H < G; then for any a € G we have 


(aN)(H/N)(aN)~' = (aHa~')/N = H/N 


so that H/N < G/N. Conversely, suppose that H/N is normal in G/N. Consider the 
homomorphism a — (aN)(H/N), the composition of the canonical map of G onto G/N 
and the canonical map of G/N onto (G/N)/(H/N). The element a will belong to the 
kernel of this map if and only if (aN)(H/N) = H/N, which happens if and only if 
aN € H/N, that is, aN = hN for some h € H. But since N is contained in H, this 
statement is equivalent to a € H. Thus H is the kernel of a homomorphism, and is 
therefore a normal subgroup of G. 

Finally, the proof of (ii) also establishes the first part of (iii); just replace H by Hy 
and G by Hz. The second part of (iii) follows from the third isomorphism theorem (with 
the same replacement). d& 


We conclude the section with a useful technical result. 


1.4.7 Proposition 


If H is a subgroup of G and N is a normal subgroup of G, we know by (1.4.3) that HN, 
the subgroup generated by H UN, is a subgroup of G. If H is also a normal subgroup 
of G, then HN is normal in G as well. More generally, if for each 7 in the index set I, we 
have H; < G, then (H;,i € I), the subgroup generated by the H; (technically, by the set 
UierH;) is a normal subgroup of G. 


Proof. A typical element in the subgroup generated by the H; is a = a,a2---ay, where az 
belongs to H;,. If g € G then 


g(aia2*++dn)g~* = (gaig™')(ga2g~*) + (gang) 
and ga,g_' € H;, because H;, <G. Thus gag~' belongs to (Hj,i€ I). & 


Problems For Section 1.4 


1. Let Z be the integers, and nZ the set of integer multiples of n. Show that Z/nZ 
is isomorphic to Z,, the additive group of integers modulo n. (This is not quite a 
tautology if we view Z,, concretely as the set {0,1,...,2—1}, with sums and differences 
reduced modulo n.) 
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2. If m divides n then Z, < Zn; for example, we can identify Z, with the subgroup 
{0, 3, 6,9} of Zia. Show that Zn /Zm = Znjm. 


3. Let a be an element of the group G, and let fg: G— G be “conjugation by a”, that 
is, fa(v) = ava~',x € G. Show that f, is an automorphism of G. 


4. An inner automorphism of G is an automorphism of the form f, (defined in Prob- 
lem 3) for some a € G. Show that the inner automorphisms of G form a group under 
composition of functions (a subgroup of the group of all automorphisms of G). 


5. Let Z(G) be the center of G, that is, the set of all 2 in G such that wy = yz for all y 
in G. Thus Z(G) is the set of elements that commute with everything in G. Show that 
Z(G) is a normal subgroup of G, and that the group of inner automorphisms of G is 
isomorphic to G/Z(G). 

6. If f is an automorphism of Z,,, show that f is multiplication by m for some m relatively 
prime to n. Conclude that the group of automorphisms of Z,, can be identified with 
the group of units mod n. 


7. The diamond diagram associated with the second isomorphism theorem (1.4.4) illus- 
trates least upper bounds and greatest lower bounds in a lattice. Verify that HN is the 
smallest subgroup of G containing both H and N, and HNN is the largest subgroup 
of G contained in both H and N. 


8. Let g be an automorphism of the group G, and f, an inner automorphism, as defined 
in Problems 3 and 4. Show that go f,og~/ is an inner automorphism. Thus the group 
of inner automorphisms of G is a normal subgroup of the group of all automorphisms. 

9. Identify a large class of groups for which the only inner automorphism is the identity 
mapping. 


1.5 Direct Products 


1.5.1 External and Internal Direct Products 


In this section we examine a popular construction. Starting with a given collection of 
groups, we build a new group with the aid of the cartesian product. Let’s start with 
two given groups H and K, and let G = H x K, the set of all ordered pairs (h,k), 
he H,k © K. We define multiplication on G componentwise: 


(hi, ky) (ho, ko) = (hike, kiko). 


Since (hih2,kik2) belongs to G, it follows that G is closed under multiplication. The 
multiplication operation is associative because the individual products on H and K are 
associative. The identity element in G is (1y,1x), and the inverse of (h,k) is (h~1,k7'). 
Thus G is a group, called the external direct product of H and Kk. 

We may regard H and K as subgroups of G. More precisely, G contains isomorphic 
copies of H and K, namely 


H = {(h,lx): h € H} and K = {(1y,k): ke K}. 
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Furthermore, H and K are normal subgroups of G. (Note that (h,k)(hi,1«)(h7',k7") = 
(hhih~+,1x«), with hhih~! € H.) Also, from the definitions of H and K, we have 


G=H K and HK = {1}, where 1 = (1y, 1x). 


If a group G contains normal subgroups H and K such that G= HK and HNK = {1}, 
we say that G is the internal direct product of H and K. 

Notice the key difference between external and internal direct products. We construct 
the external direct product from the component groups H and Kk. On the other hand, 
starting with a given group we discover subgroups H and K such that G is the inter- 
nal direct product of H and Kk. Having said this, we must admit that in practice the 
distinction tends to be blurred, because of the following result. 


1.5.2 Proposition 


If G is the internal direct product of H and K, then G is isomorphic to the external direct 
product H x K. 


Proof. Define f: H x K — G by f(h,k) = hk; we will show that f is an isomorphism. 
First note that if h € H and k € K then hk = kh. (Consider hkh~'k~1, which belongs 
to K since hkh~! € K, and also belongs to H since kh~'k~! € H; thus hkh~'k-1 = 1, 
so hk = kh.) 

(a) f is a homomorphism, since 


f((h1, k1) (he, k2)) = f(hihea, kike) = hihokike = (hiki)(heke) = f(hi, k1) f (ha, ke). 


(b) f is surjective, since by definition of internal direct product, G= HK. 

(c) f is injective, for if f(h,k) = 1 then hk = 1, so that h = k~'.Thus h belongs 
to both H and K, so by definition of internal direct product, h is the identity, and 
consequently so is k. The kernel of f is therefore trivial. d& 


External and internal direct products may be defined for any number of factors. We 
will restrict ourselves to a finite number of component groups, but the generalization to 
arbitrary cartesian products with componentwise multiplication is straightforward. 


1.5.3 Definitions and Comments 


If Hy, H2,...H, are arbitrary groups, the external direct product of the H; is the cartesian 
product G = Hy, x Hz x--- x Hy, with componentwise multiplication: 


(hi, ha, aaey hn)( - hd, Sao hi.) > (hihi, h2hy, ris hyhi,)3 
G contains an isomorphic copy of each H;, namely 
H; = AGliehs 24s hare eilijy becanety Glens Ne hi € H;}. 


As in the case of two factors, G = H,H2---H,, and H; < G for all i; furthermore, if 
g € G then g has a unique representation of the form 


g=hi ho-+-Rn where hy € Hi. 
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Specifically, g = (hi,...,hn) = (hi,1,...,1)...(1,...,l,hn). The representation is 
unique because the only way to produce the i-th component h; of g is for h; to be the i** 
component of the factor from H;. If a group G contains normal subgroups H,,..., Hn 
such that G = H,---H,, and each g € G can be uniquely represented as h,--- hy, with 
h; € H;,i =1,2,...,n, we say that G is the internal direct product of the H;. As in the 
case of two factors, if G is the internal direct product of the H;, then G is isomorphic 
to the external direct product H; x --- x Hy; the isomorphism f: Hy x --- x Hn —- G 
is given by f(hi,...,hn) = hi--+hy. The next result frequently allows us to recognize 
when a group is an internal direct product. 


1.5.4 Proposition 


Suppose that G = H,---H,, where each H; is a normal subgroup of G. The following 
conditions are equivalent: 


(1) G is the internal direct product of the Hj. 
(2) Hi (14:45 = {1} for ¢=1,...,n; thus it does not matter in which order the H; 
are listed. 


CUR) ay = for tS Tp at 


Proof. (1) implies (2): If g belongs to the product of the H;,7 #7, then g can be written 
as h,---h, where h; = 1 and h,; € H; for 7 #7. But if g also belongs to H; then g can be 
written as ky ---k, where kj = g and kj = 1 for 7 #7. By uniqueness of representation in 
the internal direct product, hj = k; = 1 for all 7, so g = 1. 

(2) implies (3): If g belongs to H; and, in addition, g = hi ---hj-1 with h; € H;, then 
g = hy ++ hail, °*> 1a, hence g = 1 by (2): 

(3) implies (1): If g € G then since G = H,--- H, we have g = hy --- hy with h; € Hj. 
Suppose that we have another representation g = k,---k, with k; € H;. Let i be the 
largest integer such that h; # k;. If 4 < n we can cancel the hi(= k;),t > i, to get 
hy-++hy = ky--- ky. lft = n then hy---hy = ky ---k; by assumption. Now any product 
of the H; is a subgroup of G (as in (1.5.2), hjh; = hjh; for i ~ j, and the result follows 
from (1.3.6)). Therefore 


i-1 


hick & [| 4: 


j=l 
and since hik; * € H;, we have hik; + = 1 by (3). Therefore h; = k;, which is a contra- 
diction. & 


Problems For Section 1.5 


In Problems 1-5, C;,, is a cyclic group of order n, for example, C, = {1,a,...,a”~!} with 
a” = 1. 


1. Let C2 be a cyclic group of order 2. Describe the multiplication table of the direct 
product C2 x C2. Is Cg x C2 cyclic? 
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2. Show that C2 x C2 is isomorphic to the four group (Section 1.2, Problem 6). 
3. Show that the direct product C2 x C3 is cyclic, in particular, it is isomorphic to C6. 


4. If n and m are relatively prime, show that C, x Ci, is isomorphic to Cyn, and is 
therefore cyclic. 


5. If n and m are not relatively prime, show that C,, x C,, is not cyclic. 

6. If p and q are distinct primes and |G| = p,|H| = q, show that the direct product G x H 
is cyclic. 

7. If H and K are arbitrary groups, show that H x kK =k x H. 


8. If G, H and K are arbitrary groups, show that G x (H x K) = (Gx H) x K. In fact, 
both sides are isomorphic to G x H x K. 


Chapter - 2 


> Ring fundamentals 


page 1 of Chapter 2 
CHAPTER 2 RING FUNDAMENTALS 


2.1 Basic Definitions and Properties 


2.1.1 Definitions and Comments A ring R is an abelian group with a multiplication 
operation (a,b) — ab that is associative and satisfies the distributive laws: a(b+c) = ab+ac 
and (a+ b)c = ab+ ac for all a,b,c € R. We will always assume that R has at least two 
elements, including a multiplicative identity 1p satisfying aly = lra = a for all a in R. 
The multiplicative identity is often written simply as 1, and the additive identity as 0. If 
a,b, and c are arbitrary elements of R, the following properties are derived quickly from the 
definition of a ring; we sketch the technique in each case. 


(1) a0=0a=0 [a0+ a0 =a(0 +0) = a0; 0a+ 0a = (0+ O)a = 0a] 
(2) (—a)b = a(—b) = —(ab) [0 = 0b = (a+(—a))b = ab+(—a)b, so (—a)b = —(ab); similarly, 


0 = a0 = a(b+ (—6)) = ab + a(—b),s0 a(—b) = —(ab)| 
(3) (-1)(-1)=1 [take a= 1,b = —1 in (2)] 
(4) (—a)(—b) =ab [replace b by —b in (2)] 
(5) a(b—c)=ab—ac_ [a(b+(-c)) = ab+ a(—c) = ab + (—(ac)) = ab — ac| 
(6) (a—b)e=ac—be [(a+(—b))c = ac + (—b)c) = ac — (be) = ac — be] 
(7) 1 


#0 [If 1 =0 then for all a we have a = al = a0 = 0, so R = {0}, contradicting the 
assumption that R has at least two elements] 


(8) The multiplicative identity is unique [If 1’ is another multiplicative identity then 
ty 1) 


2.1.2 Definitions and Comments If a and 6 are nonzero but ab = 0, we say that a and 
b are zero divisors; if a € R and for some b € R we have ab = ba = 1, we say that ais a unit 
or that a is invertible. 

Note that ab need not equal ba; if this holds for all a,b € R, we say that R is a commutative 
ring. 

An integral domain is a commutative ring with no zero divisors. 

A division ring or skew field is a ring in which every nonzero element a has a multiplicative 
inverse a! (ie., aa~t = a~'a = 1). Thus the nonzero elements form a group under 
multiplication. 

A field is a commutative division ring. Intuitively, in a ring we can do addition, subtraction 
and multiplication without leaving the set, while in a field (or skew field) we can do division 
as well. 


Any finite integral domain is a field. To see this, observe that if a 4 0, the map 
x — ax,x € R, is injective because R is an integral domain. If R is finite, the map is 
surjective as well, so that az = 1 for some z. 


The characteristic of a ring R (written Char R) is the smallest positive integer such 
that nl = 0, where nl is an abbreviation for 1+ 1+ ---1 (n times). If nl is never 0, we 
say that R has characteristic 0. Note that the characteristic can never be 1, since 1p 4 0. 
If R is an integral domain and Char R ¥ 0, then Char R must be a prime number. For if 
CharR =n =rs where r and s are positive integers greater than 1, then (r1)(s1) = nl = 0, 
so either r1 or s1 is 0, contradicting the minimality of n. 


A subring of a ring R is a subset S of R that forms a ring under the operations of 
addition and multiplication defined on R. In other words, S is an additive subgroup of 
R that contains 1p and is closed under multiplication. Note that lr is automatically the 
multiplicative identity of S, since the multiplicative identity is unique (see (8) of (2.1.1)). 
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2.1.3 Examples 
1. The integers Z form an integral domain that is not a field. 


2. Let Z,, be the integers modulo n, that is, Z, = {0,1,...,n—1} with addition os multipli- 
cation mod n. (Ifa € Z, then a is identified with all inteests a+kn,k =0,+1, ..}. Thus, 
for example, in Zg the multiplication of 3 by 4 results in 3 since 12 = 3 mod 9, a thévefore 
12 is identified with 3. 


Zn is a ring, which is an integral domain (and therefore a field, since Z,, is finite) if and 
only if n is prime. For ifn = rs then rs = 0 in Z,,; if n is prime then every nonzero element 
in Z,, has a multiplicative inverse, by Fermat’s little theorem 1.3.4. 

Note that by definition of characteristic, any field of prime characteristic p contains an 
isomorphic copy of Z,. Any field of characteristic 0 contains a copy of Z , hence a copy of 
the rationals Q. 


3. If n > 2, then the set M,(R) of all n by n matrices with coefficients in a ring R forms a 
noncommutative ring, with the identity matrix J, as multiplicative identity. If we identify 
the element c € R with the diagonal matrix cI,,, we may regard R as a subring of M,,(R). 
It is possible for the product of two nonzero matrices to be zero, so that M,,(R) is not an 
integral domain. (To generate a large class of examples, let E;; be the matrix with 1 in row 
i, column j, and 0’s elsewhere. Then Ey; Ex. = 6;,Eiu, where 6;, is 1 when j = k, and 0 
otherwise. ) 


4. Let 1,2,7 and k be basis vectors in 4-dimensional Euclidean space, and define multipli- 
cation of these vectors by 


P=pP=k=-1, ij=k, jk=i, ki=j, ji=-ij, kj =—jk, ik=—ki (1) 


Let H be the set of all linear combinations a + bi + cj + dk where a,b,c and d are real 
numbers. Elements of H are added componentwise and multiplied according to the above 
rules, i.e., 


(a+ bt + cj + dk)(x + yt + 27 + wk) = (ax — by — cz — dw) + (ay + ba + cw — dz)i 


+(az+ cx + dy — bw)j + (aw + dx + bz — cy)k. 


H (after Hamilton) is called the ring of quaternions. In fact H is a division ring; the inverse 
of a+ bi+cj + dk is (a? +b? +c? + d*?)~'(a — bi — cj — dk). 


H can also be represented by 2 by 2 matrices with complex entries, with multiplication 
of quaternions corresponding to ordinary matrix multiplication. To see this, let 


oO) 3.1. 07-3 RO 2A ay ote si 
r=[o S]- t= [o Spa [4 a) *=[F ol) 


a direct computation shows that 1,i,j and k obey the multiplication rules (1) given above. 
Thus we may identify the quaternion a+ bi + cj + dk with the matrix 


sale | at bi Ae) 


-c+tdi a-—bi 


(where in the matrix, i is /—1, not the quaternion ¢). 


The set of 8 elements +1, +7, +7, +k forms a group under multiplication; it is called 
the 
quaternion group. 
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5. If Ris a ring, then R[X], the set of all polynomials in X with coefficients in R, is also a 
ring under ordinary polynomial addition and multiplication, as is R[X),..., Xn], the set of 
polynomials in n variables X;,1 <i <n, with coefficients in R. Formally, the polynomial 
A(X) = a9 + a,X +---+a4,X” is simply the sequence (ao,...,@,); the symbol X is a 
placeholder. The product of two polynomials A(X) and B(X) is a polynomial whose X*- 
coefficient is agby + a,bp-1 +--+ + axbo. If we wish to evaluate a polynomial on R, we use 
the evaluation map 


dp +ayX +++: +anX”" Sag tayx+-:++anx” 


where x is a particular element of R. A nonzero polynomial can evaluate to 0 at all points of 
R. For example, X?+X evaluates to 0 on Zs, the field of integers modulo 2, since 1+1=0 
mod 2. We will say more about evaluation maps in Section 2.5, when we study polynomial 
rings. 


6. If Ris aring, then R[[X]], the set of formal power series 


with coefficients in R, is also a ring under ordinary addition and multiplication of power 
series. The definition of multiplication is purely formal and convergence is never mentioned; 
we simply define the coefficient of X” in the product of ag + a,X + agX? +--+ and by + 
by Xx + by X? +--+ to be agbn + aybn—1 tose An—104 + Anbo.- 


In Examples 5 and 6, if R is an integral domain, so are R[X] and R[[X]]. In Example 5, 
look at leading coefficients to show that if f(X) #0 and g(X) £0, then f(X)g(X) 40. In 
Example 6, if f(X)g(X) =0 with f(X) 4 0, let a; be the first nonzero coefficient of f(X). 
Then a,b; = 0 for all j, and therefore g(X) = 0. 


2.1.4 Lemma The generalized associative law holds for multiplication in a ring. There is 
also a generalized distributive law: 


(a1 +++++@m)(b1 +++ + bn) = S> S- aids. 


i=1 j=l 


Proof. The argument for the generalized associative law is exactly the same as for groups; 
see the beginning of Section 1.1. The generalized distributive law is proved in two stages. 
First set m = 1 and work by induction on n, using the left distributive law a(b +c) = 
ab + ac. Then use induction on m and the right distributive law (a + b)c = ac + be on 
(ay ++++ +m + Gm4i)(b1 +++: + bn). & 


2.1.5 Proposition The Binomial Theorem (a+ b)” = yy <9 (;)a*b"-* is valid in any 
ring, if ab = ba. 


Proof. The standard proof via elementary combinatorial analysis works. Specifically, 
(a+b)” =(a+b)---(a+ 5), and we can expand this product by multiplying an element (a 
or b) from 

object 1 (the first (a + b)) times an element from object 2 times ---times an element from 
object n, in all possible ways. Since ab = ba, these terms are of the form a*b"-*,0 <k <n. 
The number of terms corresponding to a given k is the number of ways of selecting k objects 
from a collection of n, namely (as & 


Problems For Section 2.1 


1. If R is a field, is R[X] a field always? sometimes? never? 
2. If Ris a field, what are the units of R[X]? 
3. Consider the ring of formal power series with rational coefficients. 
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(a) Give an example of a nonzero element that does not have a multiplicative inverse, and 
thus is not a unit. 

(b) Give an example of a nonconstant element (one that is not simply a rational number) 
that does have a multiplicative inverse, and therefore is a unit. 

4. Let Zi] be the ring of Gaussian integers a+ bi, where i = /—1 and a and b are integers. 
Show that Z[?] is an integral domain that is not a field. 

5. What are the units of Z[7]? 

6. Establish the following quaternion identities: 


(a) (Gan +yit+ zig + wk) (x9 — Yat — Zaj — wok) = (2122 + Y1Y2 + 2122+ WwW) 


+(—a1Yy2 + yite — 21W2 + W122)t + (—4129 + 2142 — Wiy2 + y1wWe2)j 


+(—@1 We + wite — yiz2 + Z1y2)k 


(b) (a9 + yot + Zo9 + wok) (a1 — Yt — Zj —- wk) = (2122 + Y1Y2 + 2122+ wwe) 


+(@1Yy2 — Yik2 + 21 We — W12Z2)t + (@12g — 2122 + WiY2 — YiWe)j 
+(@1W2 — wit, + y122 — 21Y2)k 


(c) The product of a quaternion h = a+ bi+cj+4+dk and its conjugate h* = a— bi —cj — dk 
is 
a? + b? + c? + d?. If q and t are quaternions, then (qt)* = t*q*. 


7. Use Problem 6 to establish Fuler’s Identity for real numbers &,, y,, Zr, Wr, 7 = 1,2: 


(at + yf + 27 + wi) (a5 + y8 + 25 + WS) = (wive + yiye + 2122 + wrwe)* 
+(21y2 — Yq + 212 — W122)? + (#122 — T2 + WiYy2 — YwW2)” 


+(x We — wire + y1z2 — z1y2)” 


8. Recall that an endomorphism of a group G is a homomorphism of G to itself. Thus if G 
is abelian, an endomorphism is a function f : G — G such that f(a+ 6) = f(a) + f(0) for 
all a,b € G. Define addition of endomorphisms in the natural way:(f + g)(a) = f(a) + g(a), 
and define multiplication as functional composition:(fg)(a) = f(g(a)). Show that the set 
End G of endomorphisms of G becomes a ring under these operations. 

9. What are the units of End G? 

10. It can be shown that every positive integer is the sum of 4 squares. A key step is to 
prove that if n and m can be expressed as sums of 4 squares, so can nm. Do this using 
Euler’s identity, and illustrate for the case n = 34,m = 54. 

11. Which of the following collections of n by n matrices form a ring under matrix addition 
and multiplication? 

(a) symmetric matrices 

(b) matrices whose entries are 0 except possibly in column 1 

(c) lower triangular matrices (a;; = 0 for i < j) 

(d) upper triangular matrices (a;; = 0 for i > j) 


2.2 Ideals, Homomorphisms, and Quotient Rings 


Let f: R— S, where R and S are rings. Rings are, in particular, abelian groups under 
addition, so we know what it means for f to be a group homomorphism: f (a+b) = f(a)+/f(b) 
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for all a, bin R. It is then automatic that f(0r) = 0g (see (1.3.11)). It is natural to consider 
mappings f that preserve multiplication as well as addition, i.e., 


f(a+b) = f(a) + f(b) and f(ab) = f(a) f(b) for all a,b € R. 


But here it does not follow that f maps the multiplicative identity 1p to the multiplicative 
identity 1s. We have f(a) = f(alr) = f(a)f(1R), but we cannot multiply on the left by 
f(a)~+, which might not exist. We avoid this difficulty by only considering functions f that 
have the desired behavior. 


2.2.1 Definition If f : R — S, where R and S are rings, we say that f is a ring homomor- 
phism if f(a +b) = f(a) + f(b) and f(ab) = f(a) f(b) for all a,b € R, and f(1R) =1s9. 


2.2.2 Example Let f : Z — M,(R), n > 2, be defined by f(n) = nE 1 (see (2.1.3), 
example 3). Then we have f(a+ 6) = f(a) + f(b), f(ab) = f(a) f(b), but f(1) 4 In. Thus f 
is not a ring homomorphism. 


In Chapter 1, we proved the basic isomorphism theorems for groups, and a key obser- 
vation was the connection between group homomorphisms and normal subgroups. We can 
prove similar theorems for rings, but first we must replace the normal subgroup by an object 
that depends on multiplication as well as addition. 


2.2.3 Definitions and Comments Let J be a subset of the ring R, and consider the 
following three properties: 


(1) I is an additive subgroup of R 
(2) Ifa@e Tt andr € R then ra € J, in other words, rI C I for every r € R 
(3) If@eT andr € R then ar € J, in other words, Ir C I for every r € R 


If (1) and (2) hold, J is said to be a left ideal of R. If (1) and (3) hold, I is said to be a right 
ideal of R. If all three properties are satisfied, I is said to be an ideal (or two-sided ideal) of 
R, a proper ideal if I # R, a nontrivial ideal if I is neither R nor {0}. 


If f: R > S is a ring homomorphism, its kernel is 
kerf = {re R: f(r) = 0}; 


exactly as in (1.3.13), f is injective if and only if kerf = {0}. 


Now it follows from the definition of ring homomorphism that kerf is an ideal of R. 
The kernel must be a proper ideal because if kerf = R then f is identically 0, in particular, 
f(1r) = 1s = 0g, a contradiction (see (7) of (2.1.1)). Conversely, every proper ideal is the 
kernel of a ring homomorphism, as we will see in the discussion to follow. 


2.2.4 Construction of Quotient Rings Let I be a proper ideal of the ring R. Since I is 
a subgroup of the additive group of R, we can form the quotient group R/I, consisting of 
cosets r+J,r € R. We define multiplication of cosets in the natural way: 


(r+ID(s+D=rst+I. 


To show that multiplication is well-defined, suppose that r+J=r'+Jands+I=<s'+T, 
so that r’ — r is an element of J, call it a, and s’ — s is an element of J, call it b. Thus 


r's' =(r+a)(s+b)=rs+ast+rb+ab, 
and since I is an ideal, we have as € I,rb € I, and ab € I. Consequently, r’s’ + T=rs+ 1, 


so the multiplication of two cosets is independent of the particular representatives r and s 
that we choose. 
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From our previous discussion of quotient groups, we know that the cosets of the ideal I 
form a group under addition, and the group is abelian because R itself is an abelian group 
under addition. Since multiplication of cosets r+ I and s+ J is accomplished simply by 
multiplying the coset representatives r and s in R and then forming the coset rs+J, we can 
use the ring properties of R to show that the cosets of I form a ring, called the quotient ring 
of R by I. The identity element of the quotient ring is 1p+J, and the zero element is Og+/T. 
Furthermore, if R is a commutative ring, so is R/I. The fact that I is proper is used in 
verifying that R/I has at least two elements. For if ly +J =0r+/T, then lp = 1le—Or € J; 
thus for any r € R we have r=rlpr € I, so that R = IJ, a contradiction. 


2.2.5 Proposition Every proper ideal J is the kernel of a ring homomorphism. 


Proof. Define the natural or canonical map 7: R— R/I by x(r) =r+T. We already know 
that a is a homomorphism of abelian groups and its kernel is I (see (1.3.12)). To verify that 
m@ preserves multiplication, note that 


(rs) =rs+I=(r+I)\(s+J) = x(r)r(s); 


since 
mpg) =1lr+l=1ry1, 


m is a ring homomorphism. & 


2.2.6 Proposition If f : R — S is a ring homomorphism and the only ideals of R are 
{0} and R, then f is injective. (In particular, if R is a division ring, then R satisfies this 
hypothesis. ) 


Proof. Let I = kerf, an ideal of R (see (2.2.3)). If J = R then f is identically zero, and is 
therefore not a legal ring homomorphism since f(1r) = 1g 4 0s. Thus J = {0}, so that f 
is injective. 

If RF is a division ring, then in fact R has no nontrivial left or right ideals. For suppose 
that I is a left ideal of R anda € J,a #0. Since R is a division ring, there is an element 
b € Rsuch that ba = 1, and since I is a left ideal, we have 1 € J, which implies that J = R. 
If I is a right ideal, we choose the element 6 such that ab = 1. & 


2.2.7 Definitions and Comments If X is a nonempty subset of the ring R, then (X) will 
denote the ideal generated by X, that is, the smallest ideal of R that contains X.Explicitly, 


(X) = RXR =the collection of all finite sums of the form S- r,2;8; with r;,5; € Randa; € x] 


To show that this is correct, verify that the finite sums of the given type form an ideal 
containing X. On the other hand, if J is any ideal containing X, then all finite sums 
3, rivis; must belong to J. 

If R is commutative, then ras = rsx, and we may as well drop the s. In other words: 


In a commutative ring, (X) = RX = all finite sums ye r3@;, 7, © Rix; € X. 


uv 


An ideal generated by a single element a is called a principal ideal and is denoted by < a > 
or (a). In this case, X = {a}, and therefore: 


In a commutative ring , the principal ideal generated by ais <a >= {ra:r€ R}, 


the set of all multiples of a, sometimes denoted by Ra. 


2.2.8 Definitions and Comments In an arbitrary ring, we will sometimes need to 
consider the sum of two ideals I and J, defined as {1 + y: x € I,y € J}. It follows from 
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the distributive laws that J + J is also an ideal. Similarly, the sum of two left [resp. right] 
ideals is a left [resp. right] ideal. 


Problems For Section 2.2 


1. What are the ideals in the ring of integers? 

2. Let M,(R) be the ring of n by n matrices with coefficients in the ring R. If C; is the 
subset of M,,() consisting of matrices that are 0 except perhaps in column k, show that 
C;, is a left ideal of M,,(R). Similarly, if R, consists of matrices that are 0 except perhaps 
in row k, then R, is a right ideal of M,,(R). 

3. In Problem 2, assume that F is a division ring, and let E;; be the matrix with 1 in row 
i, column 7, and 0’s elsewhere. 

(a) If A € M,,(R), show that E;;A has row j of A as its i”row, with 0’s elsewhere. 

(b) Now suppose that A € Cy. Show that E,;A has aj, in the ik position, with 0’s elsewhere, 
so that if aj, is not zero, then axp Bij A = Fiz. 

(c) If A is a nonzero matrix in Cy with aj, 4 0, and C is any matrix in C;,, show that 


S- CRO zg Hig A = C. 


i=l 


4, Continuing Problem 3, if a nonzero matrix A in Cz belongs to the left ideal I of M,,(R), 
show that every matrix in C, belongs to J. Similarly, if a nonzero matrix A in R; belongs 
to the right ideal I of M,,(R), every matrix in R, belongs to I. 

5. Show that if R is a division ring, then M,,(R) has no nontrivial two-sided ideals. 

6. In R[X], express the set I of polynomials with no constant term as < f > for an 
appropriate f, and thus show that J is a principal ideal. 

7. Let R be a commutative ring whose only proper ideals are {0} and R. Show that Risa 
field. 

8. Let R be the ring Z, of integers modulo n, where n may be prime or composite. Show 
that every ideal of R is principal. 


2.3 The Isomorphism Theorems For Rings 


The basic ring isomorphism theorems may be proved by adapting the arguments used 
in Section 1.4 to prove the analogous theorems for groups. Suppose that J is an ideal of the 
ring R, f is a ring homomorphism from R to S with kernel K, and 7 is the natural map, as 
indicated in Figure 2.3.1. To avoid awkward analysis of special cases, let us make a blanket 
assumption that any time a quotient ring Ro/Io appears in the statement of a theorem, the 
ideal I is proper. 
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R/I 


Figure 2.3.1 
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2.3.1 Factor Theorem For Rings Any ring homomorphism whose kernel contains J can 
be factored through R/J. In other words, in Figure 2.3.1 there is a unique ring homomor- 
phism f : R — S that makes the diagram commutative. Furthermore, 


(i) f is an epimorphism if and only if f is an epimorphism; 


(ii) f is a monomorphism if and only if kerf = J; 


(iii) f is an isomorphism if and only if f is an epimorphism and kerf = I. 


Proof. The only possible way to define f is f(a +J) = f(a). To verify that f is well-defined, 
note that ifa+I=b+T, thna—be€ICK, so f(a—b) =0,i-e.,f(a) = f(b). Since f is 
a ring homomorphism, so is f. To prove (i),(ii) and (iii), the discussion in (1.4.1) may be 
translated into additive notation and copied. & 


2.3.2 First Isomorphism Theorem For Rings If f : R — S is a ring homomorphism 
with kernel kK, then the image of f is isomorphic to R/K. 


Proof. Apply the factor theorem with J = K, and note that f is an epimorphism onto its 
image. & 


2.3.3 Second Isomorphism Theorem For Rings Let J be an ideal of the ring R, and 
let S be a subring of R. Then 


(a) S+ I(={a@+y:x€S,y € J}) is a subring of R; 

(b) I is an ideal of S + J; 

(c) SO J is an ideal of S; 

(d) (S +I)/T is isomorphic to S/(S J), as suggested by the “parallelogram” or “diamond” 
diagram in Figure 2.3.2. 
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SOI 


Figure 2.3.2 
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Proof. (a) Verify directly that S + I is an additive subgroup of R that contains 1 (since 
lr € S and Or € J) and is closed under multiplication. For example, ifa€ S, «EI, be 
S, y € I, then (a+ 2)(b+ y) = ab+ (ayt+ab4+ ay) Ee S+4+T1. 


(b) Since I is an ideal of R, it must be an ideal of the subring S + I. 
) 


( 


(d) Let 7: R > R/I be the natural map, and let 7 be the restriction of 7 to S. Then 7 is 
a ring homomorphism whose kernel is SM J and whose image is {a+JI:a€ S}=(S+J)/I. 
(To justify the last equality, note that if s € S and x € I we have (s+ 4) +1I=s+I/.) By 
the first isomorphism theorem for rings, S'/ker 79 is isomorphic to the image of 7g, and (d) 
follows. &e 


2.3.4 Third Isomorphism Theorem For Rings Let J and J be ideals of the ring R, 
with J C J. Then J/I is an ideal of R/I, and R/J = (R/I)/(J/D). 


Proof. Define f : R/I > R/J by f(a+I) =a+ J. To check that f is well-defined, suppose 
thata+I2=b6+4+T/. Thena-—beTlTCJ,soa+J=b+4+J. By definition of addition and 
multiplication of cosets in a quotient ring, f is a ring homomorphism. Now 


This follows from the definitions of subring and ideal. 


kerf ={a+T:a+J=Js={at+Il:aeJ}=J/I 


and 
imf ={a+J:a€ R}=R/J. 


(where im denotes image). The result now follows from the first isomorphism theorem for 
rings. de 

2.3.5 Correspondence Theorem For Rings If J is an ideal of the ring R, then the map 
S — S/T sets up a one-to-one correspondence between the set of all subrings of R containing 
I and the set of all subrings of R/I, as well as a one-to-one correspondence between the set 
of all ideals of R containing I and the set of all ideals of R/I. The inverse of the map is 
Q— 7~1(Q), where z is the canonical map: R > R/T. 


Proof. The correspondence theorem for groups yields a one-to-one correspondence between 
additive subgroups of R containing J and additive subgroups of R/I. We must check that 
subrings correspond to subrings and ideals to ideals. If S is a subring of R then S/T is 
closed under addition, subtraction and multiplication. For example, if s and s’ belong to 
S, we have (s+ I)(s'+ I) = ss’ +I € S/I. Since lp € S we have lp +I € S/TJ, proving 
that S/T is a subring of R/I. Conversely, if S/I is a subring of R/I, then S is closed under 
addition, subtraction and multiplication, and contains the identity, hence is a subring or R. 
For example, if s,s’ € S then (s+ I)(s’+ I) € S/I, so that ss’ + I =t+I for somet eS, 
and therefore ss’ —t € I. But IC S, so ss’ € S. 

Now if J is an ideal of R containing I, then J/I is an ideal of R/I by the third 
isomorphism theorem for rings. Conversely, let J/I be an ideal of R/I. Ifre Randae J 
then (r+J)(a+J) € J/T, that is, re+I € J/I. Thus for some j € J we have rx—j EI C J, 
so rx € J. A similar argument shows that «ar € J, and that J is an additive subgroup of R. 
It follows that J is an ideal of R. & 


We now consider the Chinese remainder theorem, which is an abstract version of a 
result in elementary number theory. Along the way, we will see a typical application of the 
first isomorphism theorem for rings, and in fact the development of any major theorem of 
algebra is likely to include an appeal to one or more of the isomorphism theorems. The 
following observations may make the ideas easier to visualize. 


2.3.6 Definitions and Comments 


(i) If a and b are integers that are congruent modulo n, then a — b is a multiple of n. Thus 
a — b belongs to the ideal J,, consisting of all multiples of n in the ring Z of integers. Thus 
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we may say that a is congruent to b modulo [,. In general, if a,b € R and I is an ideal of 
R, we say that a=bmod/ifa—beTl. 


(ii) The integers a and 0 are relatively prime if and only if the integer 1 can be expressed 
as a linear combination of a and b. Equivalently, the sum of the ideals J, and J, is the 
entire ring Z. In general, we say that the ideals J and J in the ring R are relatively prime 
ifl+J=R. 


(iii) If Ip, consists of all multiples of n; in the ring of integers (¢ = 1,...k), then the 
intersection N*_,In, is I,, where r is the least common multiple of the n;. If the n; are 
relatively prime in pairs, then r is the product of the n,. 


(iv) If Ry,...,R,_ are rings, the direct product of the R; is defined as the ring of n-tuples 
(a1,---,@n),a; € R;, with componentwise addition and multiplication, that is, 


(1,---,@n)+(b1,..-,bn) = (ai +b1,...,Gn+bn) and (a1,...,dn)(b1,..-, bn) = (arbi, ..-,anbn) ¥ 


The zero element is (0,...,0) and the multiplicative identity is (1,...,1). 


2.3.7 Chinese Remainder Theorem Let R be an arbitrary ring, and let [,,...,I, be 
ideals in R that are relatively prime in pairs, that is, J; + 1; = R for all i ¥ j. 


(1) If a; = 1 (the multiplicative identity of R) and a; = 0 (the zero element of R) for 
j = 2,...,n, then there is an element a € R such that a = a; mod J; for alli = 1,...,n. 
More generally, 


(2) If aj,...,a@, are arbitrary elements of R, there is an element a € R such that a = a; 
mod J; for alli =1,...,n. 


(3) If bis another element of R such that b = a; mod J; for alli =1,...,n, then b= a mod 
ILNIgN...AIn. Conversely, if b= a mod M_,J;, then b= a; mod J; for all 2. 
(4) R/();_, [i is isomorphic to the direct product []}_, R/Ji. 


Proof. 

1) If 7 > 1 we have I, + J; = R, so there exist elements 6; € J; and c; € J; such that 
J j ij j j 

b; + Ci = 1; thus 


=: 


(b; + C;) = 1; 


j=2 


Expand the left side and observe that any product containing at least one 6; belongs to J, 
while c2---c, belongs to ITj=2 I;, the collection of all finite sums of products x2--- x, with 
x; € I;. Thus we have elements b € J; and a € ITj=2 I, (a subset of each Ij) with b+a= 1. 
Consequently, a = 1 mod J; and a= 0 mod J; for 7 > 1, as desired. 


(2) By the argument of part (1), for each 7 we can find c with c; = 1 mod J; and c; = 0 
mod J;,7 #7. Ifa = aye, +--+ +4@,€p, then a has the desired properties. To see this, write 
a— a; = a— ae; + a;(c¢; — 1), and note that a — a,c; is the sum of the a;c;,7 # i, and is 
therefore congruent to 0 mod J;. 

(3) We have b = a; mod J; for all ¢ iff b— a = 0 mod J; for all i, that is, iffb-aEentyhi, 
and the result follows. 

(4) Define f : R > [J], R/I; by f(a) = (a+h,...,a4+1,). If a1,...,a, € R, then by 
part (2) there is an element a € R such that a = a; mod J; for all 7. But then f(a) = 


(a1 +hh,...,@n +I), proving that f is surjective. Since the kernel of f is the intersection 
of the ideals J;, the result follows from the first isomorphism theorem for rings. de 


The concrete version of the Chinese remainder theorem can be recovered from the 
abstract result; see Problems 3 and 4. 
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Problems For Section 2.3 


1. Show that the group isomorphisms of Section 1.4, Problems 1 and 2, are ring isomorphisms 
as well. 

2. Give an example of an ideal that is not a subring, and a subring that is not an ideal. 

3. If the integers m;,2 = 1,...,n, are relatively prime in pairs, and aj,...,@, are arbitrary 
integers, show that there is an integer a such that a = a; mod m, for all 7, and that any 
two such integers are congruent modulo m,--- Mn. 


4. If the integers m;,7 = 1,...,n, are relatively prime in pairs and m = m,---mn, show 
that there is a ring isomorphism between Z,, and the direct product |i Zm,. Specifically, 
a mod m corresponds to (a mod my,...,a mod m,). 


5. Suppose that R= R, x Ro is a direct product of rings. Let Ri be the ideal Ry x {0} = 
{(71,0) : 71 € Ri}, and let RS be the ideal {(0,r2) : ro € Ro}. Show that R/R, = Re and 
R/R, = Ry. 


If I,,...,I, are ideals, the product I,---I, is defined as the set of all finite sums 
Yo; G12 ++ Gni, Where ag; € Ip, k =1,...,n. [See the proof of part (1) of (2.3.7); check 
that the product of ideals is an ideal.] 


6. Assume that R is a commutative ring. Under the hypothesis of the Chinese remainder 
theorem, show that the intersection of the ideals J; coincides with their product. 

7. Let ,...,I, be ideals in the ring R. Suppose that R/M, I; is isomorphic to |]; R/I; via 
a+nit; > (at+h,...,a+J,). Show that the ideals I; are relatively prime in pairs. 


2.4 Maximal and Prime Ideals 


If J is an ideal of the ring R, we might ask“What is the smallest ideal containing [?” 
and “What is the largest ideal containing I?” Neither of these questions is challenging; the 
smallest ideal is I itself, and the largest ideal is R. But if J is a proper ideal and we ask for 
a maximal proper ideal containing J, the question is much more interesting. 


2.4.1 Definition A mazimal ideal in the ring R is a proper ideal that is not contained in 
any strictly larger proper ideal. 


2.4.2 Theorem Every proper ideal J of the ring R is contained in a maximal ideal. 
Consequently, every ring has at least one maximal ideal. 


Proof. The argument is a prototypical application of Zorn’s lemma. Consider the collection 
of all proper ideals containing I, partially ordered by inclusion. Every chain {J:,t € T} of 
proper ideals containing J has an upper bound, namely the union of the chain. (Note that 
the union is still a proper ideal, because the identity 1p belongs to none of the ideals J;.) 
By Zorn, there is a maximal element in the collection, that is, a maximal ideal containing 
I. Now take I = {0} to conclude that every ring has at least one maximal ideal. & 


We have the following characterization of maximal ideals. 


2.4.3 Theorem Let M be an ideal in the commutative ring R. Then M is a maximal ideal 
if and only if R/M is a field. 


Proof. Suppose M is maximal. We know that R/M is a ring (see (2.2.4)); we need to find 
the multiplicative inverse of the element a+ M of R/M, where a+ M is not the zero element, 
ie.,a ¢ M. Since M is maximal, the ideal Ra+ M, which contains a and is therefore strictly 
larger than M, must be the ring F itself. Consequently, the identity element 1 belongs to 
Ra+M. If1=ra+m where r € Rand me M, then 


(r+ M)(at+ M)=ra+M =(1—m)+M=1+M sinceme M, 


proving that r+ M is the multiplicative inverse of a+ M. 
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Conversely, if R/M is a field, then M must be a proper ideal. If not, then M = R, 
so that R/M contains only one element, contradicting the requirement that 14 0 in R/M 
(see (7) of (2.1.1)). By (2.2.6), the only ideals of R/M are {0} and R/M, so by the 
correspondence theorem (2.3.5), there are no ideals properly between M and R. Therefore 
M is a maximal ideal. & 


If in (2.4.3) we relax the requirement that R/M be a field, we can identify another class 
of ideals. 


2.4.4 Definition A prime ideal in a commutative ring R is a proper ideal P such that for 
any two elements a,b in R, 


ab € P implies that ac Porbe P. 


We can motivate the definition by looking at the ideal (p) in the ring of integers. In this 
case, a € (p) means that p divides a, so that (p) will be a prime ideal if and only if 


p divides ab implies that p divides a or p divides b, 


which is equivalent to the requirement that p be a prime number. 


2.4.5 Theorem If P is an ideal in the commutative ring R, then P is a prime ideal if and 
only if R/P is an integral domain. 


Proof. Suppose P is prime. Since P is a proper ideal, R/P is a ring. We must show that if 
(a+ P)(b+ P) is the zero element P in R/P, theona+ P= Porb+P=P,ie.,a€ Por 
b € P. This is precisely the definition of a prime ideal. 

Conversely, if R/P is an integral domain, then as in (2.4.3), P is a proper ideal. If ab € P, 
then (a+ P)(b+ P) is zero in R/P, so thata+ P= Porb+P=P,ie.,a€ Porbe P.& 


2.4.6 Corollary In a commutative ring, a maximal ideal is prime. 
Proof. This is immediate from (2.4.3) and (2.4.5). & 


2.4.7 Corollary Let f: R— S be an epimorphism of commutative rings. Then 
(i) If S is a field then ker f is a maximal ideal of R; 
(ii) If S is an integral domain then ker f is a prime ideal of R. 


Proof. By the first isomorphism theorem (2.3.2), S is isomorphic to R/ker f, and the result 
now follows from (2.4.3) and (2.4.5). & 


2.4.8 Example Let Z[X] be the set of all polynomials f(X) = ap +a, X +---+an)X",n = 
0,1,...in the indeterminate X, with integer coefficients. The ideal generated by X, that is, 
the collection of all multiples of X, is 


<X >= {f(X) € ZX] : ap =O}. 
The ideal generated by 2 is 
<2>={f(X) € Z[X]: all a; are even integers.} 


Both < X > and < 2 > are proper ideals, since 2 ¢< X > and X ¢< 2 >. In fact we can 
say much more; consider the ring homomorphisms y : Z[X] — Z and w: Z[X] — Ze given 
by y(f(X)) = ao and w(f(X)) = Go, where Gp is a9 reduced modulo 2. We will show that 
both < X > and < 2 > are prime ideals that are not maximal. 

First note that < X > is prime by (2.4.7), since it is the kernel of y. Then observe that 
< X > is not maximal because it is properly contained in < 2, X >, the ideal generated by 
2and X. 
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To verify that < 2 > is prime, note that it is the kernel of the homomorphism from 
Z[X] onto Z2[X] that takes f(X) to f(X), where the overbar indicates that the coefficients 
of f(X) are to be reduced modulo 2. Since Z2[X] is an integral domain (see the comment at 
the end of (2.1.3)), <2 > is a prime ideal. Since < 2 > is properly contained in < 2, X >, 
<2 > is not maximal. 

Finally, < 2, X > is a maximal ideal, since 


ker W = {ap + Xg(X) : ag is even and g(X) € Z[X]} =< 2,X >. 


Thus < 2,X > is the kernel of a homomorphism onto a field, and the result follows form 
(2.4.7). 


Problems For Section 2.4 


1. We know from Problem 1 of Section 2.2 that in the ring of integers, all ideals J are of 
the form < n > for some n € Z, and since n € I implies —n € I, we may take n to be 
nonnegative. Let <n > be a nontrivial ideal, so that n is a positive integer greater than 1. 
Show that <n > is a prime ideal if and only if n is a prime number. 

2. Let I be a nontrivial prime ideal in the ring of integers. Show that in fact J must be 
maximal. 

3. Let F'[X]] be the ring of formal power series with coefficients in the field F' (see (2.1.3), 
Example 6). Show that < X > is a maximal ideal. 

4. Perhaps the result of Problem 3 is a bit puzzling. Why can’t we argue that just as in 
(2.4.8), < X > is properly contained in < 2, X >, and therefore < X > is not maximal? 

5. Let I be a proper ideal of F[[X]]. Show that I C< X >, so that < X > is the unique 
maximal ideal of F[[X]]. (A commutative ring with a unique maximal ideal is called a local 
ring.) 

6. Show that every ideal of F'[[X]] is principal, and specifically every nonzero ideal is of the 
form (X") for some n = 0,1,.... 

7. Let f : R — S bearing homomorphism, with R and S commutative. If P is a prime 
ideal of S$, show that the preimage f~!(P) is a prime ideal of R. 

8. Show that the result of Problem 7 does not hold in general when P is a maximal ideal. 
9. Show that a prime ideal P cannot be the intersection of two strictly larger ideals I and 
J. 


2.5 Polynomial Rings 

In this section, all rings are assumed commutative. To see a good reason for this 
restriction, consider the evaluation map (also called the substitution map) E,, where x is a 
fixed element of the ring R. This map assigns to the polynomial ap + a1X +--:+an,X” in 
R[X] the value ap + ayv+---+a,2” in R. It is tempting to say that “obviously”, EF, is a 
ring homomorphism, but we must be careful. For example, 


E,[(a+bX)(c+dX)] = E,(ac + (ad + be) X + bdX?) = ac + (ad + be)x + bdx?, but 


E,(a+ bX)E,(c+ dX) = (a+ br)(c+ dx) = ac + adx + bac + badz, 


and these need not be equal if R is not commutative. 

If f and g are polynomials in R[X], where R is a field, ordinary long division allows 
us to express f as gg +r, where the degree of r is less than the degree of g. (The degree, 
abbreviated deg, of a polynomial ag + a1X +---+a,X™” (with leading coefficient a, # 0) 
is n; it is convenient to define the degree of the zero polynomial as —oo. We have a similar 
result over an arbitrary commutative ring, if g is monic, i.e., the leading coefficient of g is 
1. For example (with R = Z), we can divide 2X? + 10X27 +16X +10 by X74+3X +5: 


2X? +.10X? + 16X +10= 2X (X72 +3X +5) +4X? + 6X +10. 
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The remainder 4X?+6X +10 does not have degree less than 2, so we divide it by X?+3.X+5: 


4X? +6X +10 =4(X*+3X +5) —6X — 10. 
Combining the two calculations, we have 
2X° +10X7 +16X +10 = (2X + 4)(X? + 3X +5) + (-6X — 10) 


which is the desired decomposition. 


2.5.1 Division Algorithm If f and g are polynomials in R[X], with g monic, there are 
unique polynomials g and r in R[X] such that f = gg +r and deg r <deg g. If R is a field, 
g can be any nonzero polynomial. 


Proof. The above procedure, which works in any ring R, shows that q and r exist.If f = 
qgtr=qigtri where r and r; are of degree less than deg g, then g(¢—q1) = 11 —r. But if 
q—q # 0, then since g is monic, the degree of the left side is at least deg g, while the degree 
of the right side is less than deg g, a contradiction. Therefore g = q,, and consequently 
r=1T,. & 
2.5.2 Remainder Theorem If f € R[X] and a € R, then for some unique polynomial 
q(X) in R[X] we have 

F(X) = (X)(X — a) + f(a); 
hence f(a) = 0 if and only if X — a divides f(X). 
Proof. By the division algorithm, we may write f(X) = q(X)(X — a)+1r(X) where the 
degree of r is less than 1, i.e., r is a constant. Apply the evaluation homomorphism X — a 
to show that r= f(a). & 


2.5.3 Theorem If R is an integral domain, then a nonzero polynomial f in R[X] of degree 
n has at most n roots in R, counting multiplicity. 

Proof. If f(a1) = 0, then by (2.5.2), possibly applied several times, we have f(X) = 
qi(X)(X — ay)", where qi(a1) 4 0 and the degree of gq, is n— 14. If ag is another root of f, 
then 0 = f(a2) = qi(a2)(ag—a1)™. But a, 4 a2 and R is an integral domain, so q,(a2) must 
be 0, i.e. ag is a root of qi(X). Repeating the argument, we have qi(X) = q2(X)(X — a2)”, 
where go(a2) # 0 and deg qg = n—1n1 — ng. After n applications of (2.5.2), the quotient 
becomes constant, and we have f(X) = c(X — a,)™---(X — ag)" where c € R and 
ny +---+nz =n. Since R is an integral domain, the only possible roots of f are a1,..., Gx. 


& 


2.5.4 Example Let R = Zs, which is not an integral domain. The polynomial f(X) = X° 
has four roots in R, namely 0,2,4 and 6. 


Problems For Section 2.5 


In the following sequence of problems, we review the Euclidean algorithm. Let a and b 
be positive integers, with a > b. Divide a by 6b to obtain 


a=ba +71 withO<r, <b, 
then divide b by 71 to get 

b=r1q2+r2 withO<re <7, 
and continue in this fashion until the process terminates: 


ry =7oq3 +73, O< 73 < Yo, 
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ryj—-2 = 15-195 +75, OST] <Tj-1, 
Tj-1 = T5Qj41- 


1. Show that the greatest common divisor of a and 6 is the last remainder r;. 

2. If d is the greatest common divisor of a and b, show that there are integers x and y such 
that ax + by = d. 

3. Define three sequences by 


Tie = Ti-2 — Gi-1 
Ti = U2 — GXi-1 


Yi = Yi-2 — GYi-1 


for 2 = —1,0,1,... with initial conditions r_; = a, r9 = b, x-1 = 1, %) = 0, y-1 = 0, yo = 
1. (The gq; are determined by dividing r;-2 by r;_-1.) Show that we can generate all steps 
of the algorithm, and at each stage, r; = ax; + by;. 

4. Use the procedure of Problem 3 (or any other method) to find the greatest common 
divisor d of a = 123 and b = 54, and find integers x and y such that ax + by = d. 

5. Use Problem 2 to show that Z, is a field if and only if p is prime. 


If a(X) and b(X) are polynomials with coefficients in a field F’, the Euclidean algorithm 
can be used to find their greatest common divisor. The previous discussion can be taken 
over verbatim, except that instead of writing 


a=qot+r withO<r, <5, 


we write 


a(X) = qi(X)b(X) + r1(X) with deg r1(X) < deg b(X). 


The greatest common divisor can be defined as the monic polynomial of highest degree that 
divides both a(X) and b(X). 

6. Let f(X) and g(X) be polynomials in FLX], where F is a field. Show that the ideal I 
generated by f(X) and g(X), i.e., the set of all linear combinations a(X)f(X) + b(X)g(X), 
with a(X),b(X) € F[X], is the principal ideal J =< d(X) > generated by the greatest 
common divisor d(X) of f(X) and g(X). 


7. (Lagrange Interpolation Formula) Let do,@1,...,@, be distinct points in the field 
F, and define 
xX —= A 
POS lS BS acai 
7, a4 — Aj 
j#Fi 


then P;(a;) = 1 and P;(a;) = 0 for 7 #7. If bo, bi,...,b, are arbitrary elements of F' (not 
necessarily distinct), use the P; to find a polynomial f(X) of degree n or less such that 
f(a;) = 6; for all 7. 

8. In Problem 7, show that f(X) is the unique polynomial of degree n or less such that 
f(a:) = 6; for all z. 

9. Suppose that f is a polynomial in FX], where F is a field. If f(a) = 0 for every a € F, 
it does not in general follow that f is the zero polynomial. Give an example. 

10. Give an example of a field F' for which it does follow that f = 0. 


2.6 Unique Factorization 

If we are asked to find the greatest common divisor of two integers, say 72 and 60, 
one method is to express each integer as a product of primes; thus 72 = 2° x 37, 60 = 
2? «3x5. The greatest common divisor is the product of terms of the form p°, where for each 
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prime appearing in the factorization, we use the minimum exponent. Thus gcd(72,60) = 
2? x 3! x 5° = 12. (To find the least common multiple, we use the maximum exponent: 
Icm(72,60) = 2° x 3? x 5! = 360.) The key idea is that every integer (except 0,1 and -1) 
can be uniquely represented as a product of primes. It is natural to ask whether there are 
integral domains other than the integers in which unique factorization is possible. We now 
begin to study this question; throughout this section, all rings are assumed to be integral 
domains. 


2.6.1 Definitions Recall from (2.1.2) that a unit in a ring R is an element with a multi- 
plicative inverse. The elements a and 0 are associates if a = ub for some unit u. 

Let a be a nonzero nonunit; a is said to be irreducible if it cannot be represented as a 
product of nonunits. In other words, if a = bc, then either b or c must be a unit. 

Again let a be a nonzero nonunit; a is said to be prime if whenever a divides a product 
of terms, it must divide one of the factors. In other words, if a divides bc, then a divides b 
or a divides c (a divides b means that b = ar for some r € R). It follows from the definition 
that if p is any nonzero element of R, then p is prime if and only if < p> is a prime ideal. 

The units of Z are 1 and —1, and the irreducible and the prime elements coincide. But 
these properties are not the same in an arbitrary integral domain. 


2.6.2 Proposition If a is prime, then a is irreducible, but not conversely. 


Proof. We use the standard notation r|s to indicate that r divides s. Suppose that a is 
prime, and that a = bc. Then certainly albc, so by definition of prime, a|b or alc, say aJb. 
If b = ad then b = bcd, so cd = 1 and therefore c is a unit. (Note that 6 cannot be 0, for 
if so, a = bc = 0, which is not possible since a is prime.) Similarly, if alc with c = ad then 
c = bcd, so bd = 1 and bis a unit. Therefore a is irreducible. 

To give an example of an irreducible element that is not prime, consider R = Z[,/—3] = 
{a+ibV3 : a,b € Z}; in R, 2 is irreducible but not prime. To see this, first suppose that we 
have a factorization of the form 


2 = (a+ ibV3)(c + idv3); 
take complex conjugates to get 

2 = (a — ibV3)(c — idvV3). 
Now multiply these two equations to obtain 

4 = (a? + 3b7)(c? + 3d”). 


Each factor on the right must be a divisor of 4, and there is no way that a? + 3b? can be 
2. Thus one of the factors must be 4 and the other must be 1. If, say, a? + 3b? = 1, then 
a = +1 and b = 0. Thus in the original factorization of 2, one of the factors must be a 
unit, so 2 is irreducible. Finally, 2 divides the product (1 + iV/3)(1 — iV3) (= 4), so if 2 
were prime, it would divide one of the factors, which means that 2 divides 1, a contradiction 
since 1/2 is not an integer. & 

The distinction between irreducible and prime elements disappears in the presence of 
unique factorization. 


2.6.3 Definition A unique factorization domain (UFD) is an integral domain R satisfying 
the following properties: 


(UF1) Every nonzero element a in R can be expressed as @ = upi +--+ Pn, where wu is a unit 
and the p; are irreducible. 


(UF2): If a has another factorization, say a = vqi---Gm, where v is a unit and the gq; are 
irreducible, then n = m and, after reordering if necessary, p; and q; are associates for each 
U 
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Property UF 1 asserts the existence of a factorization into irreducibles, and U F2 asserts 
uniqueness. 


2.6.4 Proposition In a unique factorization domain, a is irreducible if and only if a is 
prime. 


Proof. By (2.6.2), prime implies irreducible, so assume a irreducible, and let a divide bc. 
Then we have ad = bc for some d € R. We factor d, 6 and c into irreducibles to obtain 


aud,» dy = vb, +++ bower C4 


where u,v and w are units and the d;,b; and c; are irreducible. By uniqueness of factoriza- 
tion, a, which is irreducible, must be an associate of some b; or c;. Thus a divides b or a 
divides c. & 


2.6.5 Definitions and Comments Let A be a nonempty subset of R, with 0 ¢ A. The 
element d is a greatest common divisor (gcd) of A if d divides each a in A, and whenever e 
divides each a in A, we have eld. 

If d’ is another gcd of A, we have d|d’ and d’|d, so that d and d’ are associates. We will 
allow ourselves to speak of “the” greatest common divisor, suppressing but not forgetting 
that the gcd is determined up to multiplication by a unit. 

The elements of A are said to be relatively prime (or the set A is said to be relatively 
prime) if 1 is a greatest common divisor of A. 

The nonzero element m is a least common multiple (lem) of A if each a in A divides m, 
and whenever ale for each a in A, we have mle. 

Greatest common divisors and least common multiples always exist for finite subsets 
of a UFD; they may be found by the technique discussed at the beginning of this section. 

We will often use the fact that for any a,b € R, we have alb if and only if << b >C<a>. 
This follows because a|b means that b = ac for some c € R. For short, divides means 
contains. 

It would be useful to be able to recognize when an integral domain is a UFD. The 
following criterion is quite abstract, but it will help us to generate some explicit examples. 


2.6.6 Theorem Let FR be an integral domain. Then: 

(1) If R is a UFD then R satisfies the ascending chain condition (acc) on principal ideals, 
in other words, if aj,a2,... belong to R and < ay >C< ag >C ..., then the sequence 
eventually stabilizes, that is, for some n we have < ay >=< Gn41 >=< Gnp2Q >=... 


(2) If R satisfies the ascending chain condition on principal ideals, then R satisfies UF1, 
that is, every nonzero element of R can be factored into irreducibles. 


(3) If R satisfies UF 1 and in addition, every irreducible element of R is prime, then R is a 
UFD. 


Thus R is a UFD if and only if R satisfies the ascending chain condition on principal ideals 
and every irreducible element of R is prime. 


Proof. 

(1) If < a1 >C< ag >C ... then aj41|a; for all i. Therefore the prime factors of aj+1 
consist of some (or all) of the prime factors of a;. Multiplicity is taken into account here; 
for example, if p® is a factor of a;, then p* will be a factor of a;41 for some k € {0,1,2,3}. 
Since a; has only finitely many prime factors, there will come a time when the prime factors 
are the same from that point on, that is, < a, >=< dn41 >=.... 


(2) Let a, be any nonzero element. If a; is irreducible, we are finished, so let a; = agb2 
where neither a2 nor b2 is a unit. If both a2 and bo are irreducible, we are finished, so 
we can assume that one of them, say ag, is not irreducible. Since ag divides a, we have 
<a, >C< ag >, and in fact the inclusion is proper because az ¢< a1 >. (If ag = ca, then 
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a1 = agb2 = caybe, so be is a unit, a contradiction.) Continuing, we have az = a3b3 where 
neither a3 nor b3 is a unit, and if, say, a3 is not irreducible, we find that < az >C< a3 >. 
If a; cannot be factored into irreducibles, we obtain, by an inductive argument, a strictly 
increasing chain < a, >C< ag >C.... of principal ideals. 


(3) Suppose that a = upipe-+:Pn = vqiga2:::Qm where the p; and q; are irreducible and 
u and v are units. Then p, is a prime divisor of vq,---Gm, So p; divides one of the qi, 
say qi. But q, is irreducible, and therefore p; and q, are associates. Thus we have, up 
to multiplication by units, po...pn = q2°::dm-. By an inductive argument, we must have 
m =n, and after reordering, p; and q; are associates for each i. & 


We now give a basic sufficient condition for an integral domain to be a UFD. 


2.6.7 Definition A principal ideal domain (PID) is an integral domain in which every 
ideal is principal, that is, generated by a single element. 


2.6.8 Theorem Every principal ideal domain is a unique factorization domain. For short, 
PID implies UFD. 


Proof. If< ay >C< ag >C..., let J =U; < a; >. Then J is an ideal, necessarily principal 
by hypothesis. If J =< b > then b belongs to some < a, >, so] C< ay, >. Thus ifi >n 
we have < aj >C I C< ayn >C< a; >. Therefore < a; >=< ay, > for alli > n, so that R 
satisfies the acc on principal ideals. 


Now suppose that a is irreducible. Then < a > is a proper ideal, for if <a >= R 
then 1 €< a >, so that a is a unit. By the acc on principal ideals, < a > is contained in 
a maximal ideal I. (Note that we need not appeal to the general result (2.4.2), which uses 
Zorn’s lemma.) If J =< b > then 6 divides the irreducible element a, and 6 is not a unit 
since I is proper. Thus a and 0 are associates, so < a >=< 6 >= I. But J, a maximal ideal, 
is prime by (2.4.6), hence a is prime. The result follows from (2.6.6). & 


The following result gives a criterion for a UFD to be a PID. (Terminology: the zero 
ideal is {0}; a nonzero ideal is one that is not {0}.) 


2.6.9 Theorem R is a PID iff R isa UFD and every nonzero prime ideal of R is maximal. 


Proof. Assume R is a PID; then R is a UFD by (2.6.8). If < p > is a nonzero prime ideal 
of R, then < p > is contained in the maximal ideal < q >, so that q divides the prime p. 
Since a maximal ideal must be proper, g cannot be a unit, so that p and q are associates. 
But then < p >=< q > and < p> is maximal. 

The proof of the converse is given in the exercises. & 


Problems For Section 2.6 


The problems in this section form a project designed to prove that if R is a UFD and 
every nonzero prime ideal of R is maximal, then R is a PID. 


1. Let I be an ideal of R; since {0} is principal, we can assume that I 4 {0}. Since Risa 
UFD, every nonzero element of J can be written as up,---p, where u is a unit and the p; 
are irreducible, hence prime. Let r = r(I) be the minimum such t. We are going to prove 
by induction on r that J is principal. 

If r = 0, show that [=< 1 >= R. 
2. If the result holds for all r < n, let r = n, with up,---pyn € I, hence pi---ppn € TI. 
Since p, is prime, < p; > is a prime ideal, necessarily maximal by hypothesis. By (2.4.3), 
R/ <p, >is a field. If b belongs to I but not to < p; >, show that for some c € R we have 
be-1E< pi >. 
3. By Problem 2, bc — dp; = 1 for some d € R. Show that this implies that p2---p, € I, 
which contradicts the minimality of n. Thus if b belongs to J, it must also belong to < p; >, 
that is, I C< p, >. 
4. Define J = {a € R: xp; € I}, and show that J is an ideal. 
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5. Show that Jp, = I. 

6. Since p++: py = (p2-+:Pn)pr € I, we have po--- pn € J. Use the induction hypothesis 
to conclude that J is principal. 

7. Let p and q be prime elements in the integral domain R, and let P =< p > and 
Q =< q > be the corresponding prime ideals. Show that it is not possible for P to be a 
proper subset of Q. 

8. If R is a UFD and P is a nonzero prime ideal of R, show that P contains a nonzero 
principal prime ideal. 


2.7 Principal Ideal Domains and Euclidean Domains 


In Section 2.6, we found that a principal ideal domain is a unique factorization domain, 
and this exhibits a class of rings in which unique factorization occurs. We now study some 
properties of PID’s, and show that any integral domain in which the Euclidean algorithm 
works is a PID. If J is an ideal in Z, in fact if J is simply an additive subgroup of Z, then I 
consists of all multiples of some positive integer n; see Section 1.1, Problem 6. Thus Z is a 
PID. 

Now suppose that A is a nonempty subset of the PID R. The ideal < A > generated 
by A consists of all finite sums }> r;a; with r; € R and a; € A; see (2.2.7). We show that if 
d is a greatest common divisor of A, then d generates A, and conversely. 


2.7.1 Proposition Let R be a PID, with A a nonempty subset of R. Then d is a greatest 
common divisor of A if and only if d is a generator of < A>. 


Proof. Let d be a ged of A, and assume that < A >=< b>. Then d divides every a € A, 
so d divides all finite sums )* r;a;, in particular d divides b, hence < b >C< d >, that is, 
<A>C<d>. But if a € Athen a €< b >, so that b divides a. Since d is a gcd of 
A, it follows that b divides d, so < d > is contained in < b >=< A >. We conclude that 
< A>=<d>, proving that d is a generator of < A>. 


Conversely, assume that d generates < A >. If a€ A then a is a multiple of d, so that 
dla. Since (by (2.2.7)) d can be expressed as }* r;a;, any element that divides everything in 
A divides d, so that dis a gcd of A. & 


2.7.2 Corollary If dis a gcd of A, where A is a nonempty subset of the PID R, then d 
can be expressed as a finite linear combination )* rj;a; of elements of A with coefficients in 
R. 


Proof. By (2.7.1), d€< A>, and the result follows from (2.2.7). & 


As a special case, we have the familiar result that the greatest common divisor of two 
integers a and b can be expressed as ax + by for some integers x and y. 


The Euclidean algorithm in Z is based on the division algorithm: if a and 6 are integers 
and b # 0, then a can be divided by 6 to produce a quotient and remainder. Specifically, 
we have a = bq +r for some g,r € Z with |r| < |b|.. The Euclidean algorithm performs 
equally well for polynomials with coefficients in a field; the absolute value of an integer is 
replaced by the degree of a polynomial. It is possible to isolate the key property that makes 
the Euclidean algorithm work. 


2.7.3 Definition Let R be an integral domain. R is said to be a Euclidean domain (ED) 
if there is a function UW from R \ {0} to the nonnegative integers satisfying the following 
property: 

If a and 6 are elements of R, with b 4 0, then a can be expressed as bq + r for some 
q,r © R, where either r = 0 or U(r) < U(d). 

We can replace“r = 0 or U(r) < U(b)” by simply “U(r) < U(b)” if we define U(0) to 
be —oo. 
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In any Euclidean domain, we may use the Euclidean algorithm to find the greatest 
common divisor of two elements; see the Problems in Section 2.5 for a discussion of the 
procedure in Z and in FX], where F is a field. 


A Euclidean domain is automatically a principal ideal domain, as we now prove. 


2.7.4 Theorem If FR is a Euclidean domain, then R is a principal ideal domain. For short, 
ED implies PID. 


Proof. Let I be an ideal of R. If J = {0} then J is principal, so assume I 4 {0}. Then 
{U(b) :b € I,b £0} is a nonempty set of nonnegative integers, and therefore has a smallest 
element n. Let b be any nonzero element of J such that Y(b) = n; we claim that I =< b>. 
For if a belongs to I then we have a = bg+r where r = 0 or U(r) < U(b). Nowr=a—bq El 
(because a and b belong to I), so if r 4 0 then U(r) < W(b) is impossible by minimality of 
U(b). Thus b is a generator of I. & 


The most familiar Euclidean domains are Z and F[X], with F a field. We now examine 
some less familiar cases. 


2.7.5 Example Let Z[Vd] be the ring of all elements a + bVd, where a,b € Z. If d = 
—2,—1,2 or 3, we claim that Z[Vd] is a Euclidean domain with 


U(a + bVd) = |a? — db? |. 


Since the a + bVd are real or complex numbers, there are no zero divisors, and Z[Vd] is an 
integral domain. Let a, 3 € Z[Vd], 6 4 0, and divide a by 6 to get e+ yVd. Unfortunately, 
x and y need not be integers, but at least they are rational numbers. We can find integers 
reasonably close to x and y; let xp and yo be integers such that |2 — zo| and |y — yo| are at 
most 1/2. Let 


q=2o+yovd, r= GB((x—20)+(y—yo)Vd); then Bg +r = B(x +yvVd) =a. 
We must show that U(r) < U(G). Now 
(a+ bV4d) = |(a + bVd)(a — bv d)|, 


and it follows (Problem 4) that for all 7,6 € Z[/d] we have 


(75) = U(q)¥ (4). 


(When d = —1, this says that the magnitude of the product of two complex numbers is the 
product of the magnitudes.) Thus U(r) = W(@)|(a — xo)? — d(y — yo)?], and the factor in 
brackets is at most ++ |d|(4) < ++ 2 = 1. The only possibility for equality occurs when 
d = 3 (d= —3 is excluded by hypothesis) and |x — xo| = |y — yo| = 3. But in this case, the 
factor in brackets is |+ — 3(4)| = 3 < 1. We have shown that U(r) < U(), so that Z[Vd] 
is a Euclidean domain. 


When d= —1, we obtain the Gaussian integers a+ bi, a,b€ Z, i= V-1. 
Problems For Section 2.7 


1. Let A = {aj,...,@n} be a finite subset of the PID R. Show that m is a least common 
multiple of A iff m is a generator of the ideal NP_, < a; >. 

2. Find the gcd of 11 + 32 and 8 — 7 in the ring of Gaussian integers. 

3. Suppose that R is a Euclidean domain in which U(a) < V(ab) for all nonzero elements 
a,b € R. Show that U(a) > V(1), with equality if and only if a is a unit in R. 

4. Let R = Z[Vd], where d is any integer, and define U(a + bVd) = |a? — db?|. Show 
that for all nonzero a and 3, U(aZ) = V(a)¥(%), and if d is not a perfect square, then 
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W(a) < Wap). 

5. Let R = Z[Vd] where d is not a perfect square. Show that 2 is not prime in R. (Show 
that 2 divides d? — d.) 

6. If dis a negative integer with d < —3, show that 2 is irreducible in Z[vd]. 

7. Let R = Z[Vd] where d is a negative integer. We know (see (2.7.5)) that R is an ED, 
hence a PID and a UFD, for d= —1 and d = —2. Show that for d < —3, R is not a UFD. 
8. Find the least common multiple of 11 + 32 and 8 —7 in the ring of Gaussian integers. 

9. If a=a-+ bi is a Gaussian integer, let V(a) = a? + b? as in (2.7.5). If U(a) is prime in 
Z, show that a is prime in Z/i]. 


2.8 Rings of Fractions 


It was recognized quite early in mathematical history that the integers have a signif- 
icant defect: the quotient of two integers need not be an integer. In such a situation a 
mathematician is likely to say“I want to be able to divide one integer by another, and I 
will”. This will be legal if the computation takes place in a field F' containing the integers 
Z. Any such field will do, since if a and b belong to F and b 4 0, then a/b € F. How do 
we know that a suitable F' exists? With hindsight we can take F' to be the rationals Q, the 
reals R, or the complex numbers C. In fact, Q is the smallest field containing Z, since any 
field F > Z contains a/b for all a,b € Z, b #0, and consequently F D Q. 

The same process that leads from the integers to the rationals allows us to construct, 
for an arbitrary integral domain D, a field F' whose elements are (essentially) fractions 
a/b, a,b € D,b #0. F is called the field of fractions or quotient field of D. The mathe- 
matician’s instinct to generalize then leads to the following question: If R is an arbitrary 
commutative ring (not necessarily an integral domain), can we still form fractions with nu- 
merator and denominator in R? Difficulties quickly arise; for example, how do we make 
sense of £5 when bd = 0? Some restriction must be placed on the allowable denominators, 
and we will describe a successful approach shortly. Our present interest is in the field of 
fractions of an integral domain, but later we will need the more general development. Since 
the ideas are very similar, we will give the general construction now. 


2.8.1 Definitions and Comments Let S be a subset of the ring R; we say that S' is 
multiplicative if 0 ¢ S, 1 € S, and whenever a and b belong to S, we have ab € S. We 
can merge the last two requirements by stating that S is closed under multiplication, if we 
regard 1 as the empty product. Here are some standard examples. 


(1) S = the set of all nonzero elements of an integral domain. 
(2) S = the set of all elements of a commutative ring R that are not zero divisors. 
(3) S = R\ P, where P is a prime ideal of the commutative ring R. 


If S is a multiplicative subset of the commutative ring R, we define the following 
equivalence relation on R x S: 


(a,b) ~ (c,d) iff for some s € S we have s(ad — bc) = 0. 


If we are constructing the field of fractions of an integral domain, then (a,b) is our first 
approximation to a/b. Also, since the elements s € S are never 0 and R has no zero divisors, 
we have (a,b) ~ (c,d) iff ad = bc, and this should certainly be equivalent to a/b = c/d. 

Let us check that we have a legal equivalence relation. [Commutativity of multiplication 
will be used many times to slide an element to a more desirable location in a formula. There 
is a theory of rings of fractions in the noncommutative case, but we will not need the results, 
and in view of the serious technical difficulties that arise, we will not discuss this area.] 

Reflexivity and symmetry follow directly from the definition. For transitivity, suppose 
that (a,b) ~ (c,d) and (c,d) ~ (e, f). Then for some elements s and t in S we have 


s(ad — bc) = 0 and t(cf — de) = 0. 


page 24 of Chapter 2 
Multiply the first equation by tf and the second by sb, and add the results to get 
std(af — be) = 0, 


which implies that (a,b) ~ (e, f), proving transitivity. 

Ifa ¢ Rand b € S, we define the fraction ¢ to be the equivalence class of the pair 
(a,b). The set of all equivalence classes is denoted by S~!R, and is called (in view of what 
we are about to prove) the ring of fractions of R by S. The term localization of R by S is 
also used, because it will turn out that in Examples (1) and (3) above, S~'R is a local ring 
(see Section 2.4, Problem 5). 

We now make the set of fractions into a ring in a natural way. 
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additive identity: 9 (= © for any s € S) 

additive inverse: —(¢) = 5S 

multiplicative identity: + (= * for any s € S) 

2.8.2 Theorem With the above definitions, S~'R is a commutative ring. If R is an integral 


domain, so is S~R. If R is an integral domain and S$ = R \ {0}, then S~'R is a field (the 
field of fractions or quotient field of R). 


Proof. First we show that addition is well-defined. If a1/b1 = c1/dy and a2/b2 = c2/de, then 
for some s,t € S' we have 


s(ayd, me b1c1) = 0 and t(agd2 = b2C2) =0 (1) 


Multiply the first equation of (1) by tbedg and the second equation by sb,d,, and add the 
results to get 
st|(azbe + azb1)d\d — (cid + c2d1)b1 ba] = 0. 


Thus 
a ,b2 + agb; = ci dg + cod, 


by ba : did,” 


in other words, 


so that addition of fractions does not depend on the particular representative of an equiva- 
lence class. 

Now we show that multiplication is well-defined. We follow the above computation as 
far as (1), but now we multiply the first equation by tagd2, the second by sc,b;, and add. 
The result is 

stlayaodi do = by b2c1¢9] =0 


which implies that 


as desired. We now know that the fractions in S~!R can be added and multiplied in exactly 
the same way as ratios of integers, so checking the defining properties of a commutative 
ring essentially amounts to checking that the rational numbers form a commutative ring; 
see Problems 3 and 4 for some examples. 

Now assume that R is an integral domain. It follows that if a/b is zero in S~'R, ice., 
a/b = 0/1, then a = 0 in R. (For some s € S we have sa = 0, and since R is an integral 
domain and s 4 0, we must have a = 0.) Thus if ¢5 = 0, then ac = 0, so either a or c is 0, 
and consequently either a/b or c/d is zero. Therefore S~1R is an integral domain. 

If R is an integral domain and S = R \ {0}, let a/b be a nonzero element of S~'R. 
Then both a and 6b are nonzero, so that a,b € S. By definition of multiplication we have 
ab a +. Thus a/b has a multiplicative inverse, so that S~1R is a field. & 


When we go from the integers to the rational numbers, we don’t lose the integers in the 
process, in other words, the rationals contain a copy of the integers, namely, the rationals 
of the form a/1, a € Z. So a natural question is whether S~!R contains a copy of R. 


2.8.3 Proposition Define f : R — S~'R by f(a) =a/1. Then f is a ring homomorphism. 
If S has no zero divisors then f is a monomorphism, and we say that R can be embedded in 
S-!R. In particular, 
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(i) A commutative ring R can be embedded in its complete (or full) ring of fractions (S~'R, 
where S consists of all non-divisors of zero in R). 
(ii) An integral domain can be embedded in its quotient field. 


Proof. We have f(a+b) = F®=¢+42= f(a) +f(0), flab) = 2% = $F = fla) fd), 
and f(1) = }, proving that f is a ring homomorphism. If $ has no zero divisors and 
f(a) =a/1 = 0/1, then for some s € S we have sa = 0, and since s cannot be a zero divisor, 
we have a = 0. Thus f is a monomorphism. & 


2.8.4 Corollary The quotient field F of an integral domain R is the smallest field containing 
R. 


Proof. By (2.8.3), we may regard R as a subset of F’, so that F is a field containing R. 
But if L is any field containing R, then all fractions a/b, a,b € R, must belong to L. Thus 
FCL.&@ 


Problems For Section 2.8 

1. If the integral domain D is in fact a field, what is the quotient field of D? 

2. If D is the set F(X] of all polynomials over a field F’', what is the quotient field of D? 

3. Give a detailed proof that addition in a ring of fractions is associative. 

4. Give a detailed proof that the distributive laws hold in a ring of fractions. 

5. Let R be an integral domain with quotient field F’, and let h be a ring monomorphism 
from R to a field L. Show that h has a unique extension to a monomorphism from F to L. 
6. Let h be the ring homomorphism from Z to Z,, p prime, given by h(x) = x mod p. Why 
can’t the analysis of Problem 5 be used to show that h extends to a monomorphism of the 
rationals to Z,? (This can’t possibly work since Z, is finite, but what goes wrong’) 

7. Let S be a multiplicative subset of the commutative ring R, with f : R — S~'R defined 
by f(a) =a/1. If g is a ring homomorphism from R to a commutative ring R’ and g(s) isa 
unit in R’ for each s € S, we wish to find a ring homomorphism 9 : S~!R — R’ such that 
a(f(a)) = g(a) for every a € R. Thus the diagram below is commutative. 


g 
R — R 
f 4 Sf 4 


S'R 
Show that there is only one conceivable way to define g. 


8. Show that the mapping you have defined in Problem 7 is a well-defined ring homomor- 
phism. 


2.9 Irreducible Polynomials 


2.9.1 Definitions and Comments In (2.6.1) we defined an irreducible element of a ring; 
it is a nonzero nonunit which cannot be represented as a product of nonunits. If R is an 
integral domain, we will refer to an irreducible element of R[X] as an irreducible polynomial. 
Now in F'[X], where F is a field, the units are simply the nonzero elements of F’ (Section 
2.1, Problem 2). Thus in this case, an irreducible element is a polynomial of degree at least 
1 that cannot be factored into two polynomials of lower degree. A polynomial that is not 
irreducible is said to be reducible or factorable. For example, X? +1, regarded as an element 
of R[X], where R is the field of real numbers, is irreducible, but if we replace R by the larger 
field C of complex numbers, X? + 1 is factorable as (X — i)(X +7), i= /—1. We say that 
X? +1 is irreducible over R but not irreducible over C. 


Now consider D[X], where D is a unique factorization domain but not necessarily a field, 
for example, D = Z. The polynomial 12X + 18 is not an irreducible element of Z[X] because 
it can be factored as the product of the two nonunits 6 and 2X + 3. It is convenient to 
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factor out the greatest common divisor of the coefficients (6 in this case). The result is a 
primitive polynomial, one whose content (gcd of coefficients) is 1. A primitive polynomial 
will be irreducible if and only if it cannot be factored into two polynomials of lower degree. 


In this section, we will compare irreducibility over a unique factorization domain D and 
irreducibility over the quotient field F of D. Here is the key result. 


2.9.2 Proposition Let D be a unique factorization domain with quotient field F’. Suppose 
that f is a nonzero polynomial in D[X] and that f can be factored as gh, where g and 
h belong to F[X]. Then there is a nonzero element A € F such that Ag € D[X] and 
\~th € D[X]. Thus if f is factorable over F, then it is factorable over D. Equivalently, if 
f is irreducible over D, then f is irreducible over F’. 


Proof. The coefficients of g and h are quotients of elements of D. If a is the least common 
denominator for g (technically, the least common multiple of the denominators of the coef- 
ficients of g), then g* = ag € D[X]. Similarly we have h* = bh € D[X]. Thus abf = g*h* 
with g*,h* € D[X] and c=abe D. 

Now if p is a prime factor of c, we will show that either p divides all coefficients of g* 
or p divides all coefficients of h*. We do this for all prime factors of c to get f = goho with 
go, ho € D[X]. Since going from g to go involves only multiplication or division by nonzero 
constants in D, we have go = Ag for some nonzero \ € F. But then hp = A~'h, as desired. 

Thus let 


9g (X)=g0. t+ 9X +---+95X%, h*(X) Hho HA X+--- + hy X*. 
Since p is a prime factor of c = ab and abf = g*h*,p must divide all coefficients of g*h*. If 


p does not divide every g; and p does not divide every h;, let g, and h, be the coefficients 
of minimum index not divisible by p. Then the coefficient of X“*” in g*h* is 


Goruty + Grhuto—1 tes + guly +++ + GQuto-1h1 + Gu+vho. 


But by choice of u and v, p divides every term of this expression except gyhy, so that p 
cannot divide the entire expression. So there is a coefficient of g*h* not divisible by p, a 
contradiction. & 


The technique of the above proof yields the following result. 


2.9.3 Gauss’ Lemma Let f and g be nonconstant polynomials in D[X], where D is a 
unique factorization domain. If c denotes content, then c(fg) = c(f)c(g), up to associates. 
In particular, the product of two primitive polynomials is primitive. 


Proof. By definition of content we may write f = c(f)f* and g = c(g)g* where f* and g* 
are primitive. Thus fg = c(f)c(g)f*g*. It follows that c(f)c(g) divides every coefficient 
of fg, so c(f)c(g) divides c(fg). Now let p be any prime factor of c(fg); then p divides 
c( f)c(g) f*g*, and the proof of (2.9.2) shows that either p divides every coefficient of c(f) f* 
or p divides every coefficient of c(g)g*. If, say, p divides every coefficient of c(f)f*, then 
(since p is prime) either p divides c(f) or p divides every coefficient of f*. But f* is primitive, 
so that p divides c(f), hence p divides c(f)c(g). We conclude that c(fg) divides c(f)c(g), 
and the result follows. & 


2.9.4 Corollary (of the proof of (2.9.3)) If h is a nonconstant polynomial in D[X] and 
h = ah* where h* is primitive and a € D, then a must be the content of h. 


Proof. Since a divides every coefficient of h, a must divide c(h). If p is any prime factor of 
c(h), then p divides every coefficient of ah*, and as in (2.9.3), either p divides a or p divides 
every coefficient of h*, which is impossible by primitivity of h*. Thus c(h) divides a, and 
the result follows. & 
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Proposition 2.9.2 yields a precise statement comparing irreducibility over D with irre- 
ducibility over F’. 


2.9.5 Proposition Let D be a unique factorization domain with quotient field F’. If f is 
a nonconstant polynomial in DLX], then f is irreducible over D if and only if f is primitive 
and irreducible over F’. 


Proof. If f is irreducible over D, then f is irreducible over F by (2.9.2). If f is not 
primitive, then f = c(f)f* where f* is primitive and c(f) is not a unit. This contradicts 
the irreducibility of f over D. Conversely, if f = gh is a factorization of the primitive 
polynomial f over D, then g and h must be of degree at least 1. Thus neither g nor h is a 
unit in F[X], so f = gh is a factorization of f over F’. & 


Here is another basic application of (2.9.2). 
2.9.6 Theorem If R is a unique factorization domain, so is R[X]. 


Proof. If f € R[X], f 4 0, then f can be factored over the quotient field F as f = fi fo--- fr, 
where the f; are irreducible polynomials in F[X]. (Recall that F'[X] is a Euclidean domain, 
hence a unique factorization domain.) By (2.9.2), for some nonzero A; € F’ we may write 
f = nf fe-+: fe) with Afi and Ay fo--- fe in R[X]. Again by (2.9.2), we have 
Nf fe = Py these te = Copies yy eed) with Oefoand AS Aes fee 
R[X]. Continuing inductively, we express f as Ree Mi fi where the r;f; are in R[X] and 
are irreducible over F. But A; f; is the product of its content and a primitive polynomial 
(which is irreducible over F’, hence over R by (2.9.5)). Furthermore, the content is either a 
unit or a product of irreducible elements of the UFD R, and these elements are irreducible 
in R[X] as well. This establishes the existence of a factorization into irreducibles. 

Now suppose that f = g,---gp = h,---h,, where the g; and h; are nonconstant ir- 
reducible polynomials in R[X]. (Constant polynomials cause no difficulty because R is a 
UFD.) By (2.9.5), the g; and h; are irreducible over F’, and since F[X] is a UFD, we have 
r = s and, after reordering if necessary, g; and h; are associates (in F'|X]) for each 1. Now 
g; = c,h; for some constant c; € F, and we have c; = a;/b; with a;,b; € R. Thus b;g; = ajhi, 
with g; and h; primitive by (2.9.5). By (2.9.4), bjg; has content b; and a;h; has content a;. 
Therefore a; and b; are associates, which makes c; a unit in R, which in turn makes g; and 
h; associates in RLX], proving uniqueness of factorization. & 


The following result is often used to establish irreducibility of a polynomial. 


2.9.7 Eisenstein’s Irreducibility Criterion Let R be a UFD with quotient field F’, and 
let f(X) =a,X" +---+a,X + a9 be a polynomial in R[X], with n > 1 and a, 4 0. If p is 
prime in R, p divides a; for 0 <i <n, but p does not divide a, and p? does not divide ao, 
then f is irreducible over F’. Thus by (2.9.5), if f is primitive then f is irreducible over R. 


Proof. If we divide f by its content to produce a primitive polynomial f*, the hypothesis 
still holds for f*. (Since p does not divide a,, it is not a prime factor of c(f), so it must 
divide the i*” coefficient of f* for 0 < i <n.) If we can prove that f* is irreducible over 
R, then by (2.9.5), f* is irreducible over F', and therefore so is f. Thus we may assume 
without loss of generality that f is primitive, and prove that f is irreducible over R. 
Assume that f = gh, with g(X) = go +--:+9-X" and h(X) = hot+---+h,X*. If 
r = 0 then go divides all coefficients a; of f, so go divides c(f), hence g(= go) is a unit. 
Thus we may assume that r > 1, and similarly s > 1. By hypothesis, p divides ag = goho 
but p? does not divide ag, so p cannot divide both gy and ho. Assume that p fails to divide 
ho, so that p divides go; the argument is symmetrical in the other case. Now g-hs = Gn, 
and by hypothesis, p does not divide a,,, so that p does not divide g,. Let i be the smallest 
integer such that p does not divide g;; then 1 <i<r<n (sincer+s=nand s>1). Now 


a, = gohi + hi-1 + +++ + giho 
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and by choice of 2, p divides go,...,9;-1. But p divides the entire sum a;, so p must divide 
the last term g;ho. Consequently, either p divides g;, which contradicts the choice of 7, or p 
divides ho, which contradicts our earlier assumption. Thus there can be no factorization of 
f as a product of polynomials of lower degree, in other words, f is irreducible over R. & 


Problems For Section 2.9 


1. (The rational root test, which can be useful in factoring a polynomial over Q.) 

Let f(X) =a,X" +---+a,X + a9 € Z[X]. If f has a rational root u/v where wu and v are 
relatively prime integers and v 4 0, show that v divides a, and wu divides ao. 

2. Show that for every positive integer n, there is at least one irreducible polynomial of 
degree n over the integers. 

3. If f(X) € ZX] and p is prime, we can reduce all coefficients of f modulo p to obtain 
a new polynomial f,(X) € Z,[X]. If f is factorable over Z, then f, is factorable over Z,. 
Therefore if f, is irreducible over Z,, then f is irreducible over Z. Use this idea to show 
that the polynomial X3 + 27X?+5X +97 is irreducible over Z. (Note that Eisenstein does 
not apply.) 
4. If we make a change of variable X = Y + c in the polynomial f(X), the result is a new 
polynomial g(Y) = f(Y +c). If g is factorable over Z, so is f since f(X) = g(X —c). Thus 
if f is irreducible over Z, so is g. Use this idea to show that X*+4X°46X?+4X44 is 
irreducible over Z. 

5. Show that in Z[X], the ideal < n, X >,n > 2, is not principal, and therefore Z[X] is a 
UFD that is not a PID. 

6. Show that if F is a field, then F[X,Y], the set of all polynomials }*a;;X'Y’, aij € F, 
is not a PID since the ideal < X,Y > is not principal. 

7. Let f(X,Y) = X?+Y?+41€C[X,Y], where C is the field of complex numbers. Write 
f as Y? + (X? +1) and use Eisenstein’s criterion to show that f is irreducible over C. 

8. Show that f(X,Y) = X° + Y?+1 is irreducible over C. 
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Field Fundamentals 


3.1 Field Extensions 


If F is a field and F[X] is the set of all polynomials over F, that is, polynomials with 
coefficients in F’, we know that FX] is a Euclidean domain, and therefore a principal ideal 
domain and a unique factorization domain (see Sections 2.6 and 2.7). Thus any nonzero 
polynomial f in FLX] can be factored uniquely as a product of irreducible polynomials. 
Any root of f must be a root of one of the irreducible factors, but at this point we have 
no concrete information about the existence of roots and how they might be found. For 
example, X?+ 1 has no real roots, but if we consider the larger field of complex numbers, 
we get two roots, +7 and —7. It appears that the process of passing to a larger field may 
help produce roots, and this turns out to be correct. 


3.1.1 Definitions 


If F and EF are fields and F' C E, we say that E is an extension of F', and we write F < E, 
or sometimes E’/F’. 

If E is an extension of F’, then in particular E is an abelian group under addition, and 
we may multiply the “vector” « € E by the “scalar” ’ € F, and the axioms of a vector 
space are satisfied. Thus if F < E, then E is a vector space over F’. The dimension of this 
vector space is called the degree of the extension, written [E: F]. If [EZ : F] =n < oo, we 
say that E is a finite extension of F’, or that the extension E'/F is finite, or that E is of 
degree n over F. 

If f is a nonconstant polynomial over the field F, and f has no roots in F’, we can 
always produce a root of f in an extension field of F’. We do this after a preliminary 
result. 


3.1.2 Lemma 


Let f: F — E be a homomorphism of fields, i.e., f(a+6) = f(a) + f(0), f(ab) = f(a) f(b) 
(all a,b € F), and f(17) =1g. Then f is a monomorphism. 
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Proof. First note that a field F has no ideals except {0} and F. For if a is a nonzero 
member of the ideal J, then ab = 1 for some b € F,, hence 1 € J, and therefore I = F. 
Taking I to be the kernel of f, we see that I cannot be all of F because f(1) 4 0. Thus 
I must be {0}, so that f is injective. & 


3.1.3. Theorem 


Let f be a nonconstant polynomial over the field Ff’. Then there is an extension E/F' and 
an element a € F such that f(a) = 0. 


Proof. Since f can be factored into irreducibles, we may assume without loss of generality 
that f itself is irreducible. The ideal J = (f(X)) in FX] is prime (see (2.6.1)), in fact 
maximal (see (2.6.9)). Thus EB = F[X]/TJ is a field by (2.4.3). We have a problem at this 
point because F' need not be a subset of &, but we can place an isomorphic copy of F 
inside EF via the homomorphism h: a > a+ I; by (3.1.2), h is a monomorphism, so we 
may identify F with a subfield of E. Now let a = X +]; if f(X) = ap +a, X +---+a,X", 
then 


f(a) = (ag +L) +ay(X +2) +--+ + a,(X +1)” 
= (a9 +a,X +--+ +a,X") +1 
= f(X)+I 


which is zeroin FE. & 


The extension E is sometimes said to be obtained from F by adjoining a root a of f. 
Here is a further connection between roots and extensions. 


3.1.4 Proposition 


Let f and g be polynomials over the field F. Then f and g are relatively prime if and 
only if f and g have no common root in any extension of F’. 


Proof. If f and g are relatively prime, their greatest common divisor is 1, so there are 
polynomials a(X) and b(X) over F’ such that a(X)f(X) + b(X)g(X) = 1. If aisa 
common root of f and g, then the substitution of a for X yields 0 = 1, a contradiction. 
Conversely, if the greatest common divisor d(X) of f(X) and g(X) is nonconstant, let E 
be an extension of F in which d(X) has a root a (E exists by (3.1.3)). Since d(X) divides 
both f(X) and g(X), a is a common root of f andgin E. & 


3.1.5 Corollary 


If f and g are distinct monic irreducible polynomials over F’, then f and g have no common 
roots in any extension of F’. 


Proof. If h is a nonconstant divisor of the irreducible polynomials f and g, then up 
to multiplication by constants, h coincides with both f and g, so that f is a constant 
multiple of g. This is impossible because f and g are monic and distinct. Thus f and g 
are relatively prime, and the result follows from (3.1.4). & 


3.1. FIELD EXTENSIONS 3 


If E is an extension of F and a € E is a root of a polynomial f € FLX], it is often 
of interest to examine the field F(a) generated by F and a, in other words the smallest 
subfield of E containing F' and a (more precisely, containing all elements of F’ along 
with a). The field F(a) can be described abstractly as the intersection of all subfields of 
FE containing F' and a, and more concretely as the collection of all rational functions 


ag + a1 + +++ + Ama™ 
bo + bia+-+++ bra” 


with a;,b; € Fym,n =0,1,..., and bp + bia +---+b,a" #0. In fact there is a much 
less complicated description of F(a), as we will see shortly. 


3.1.6 Definitions and Comments 


If E is an extension of F’, the element a € E is said to be algebraic over F is there is a 
nonconstant polynomial f € FX] such that f(a) = 0; if a is not algebraic over F, it is 
said to be transcendental over F’. If every element of F is algebraic over F’, then E is said 
to be an algebraic extension of F. 

Suppose that a € E is algebraic over F’, and let J be the set of all polynomials g 
over F' such that g(a) = 0. If gi: and gz belong to I, so does g; + go, and if g € I and 
c € F(X], then cg € I. Thus I is an ideal of F[X], and since FX] is a PID, I consists of 
all multiples of some m(X) € FX]. Any two such generators must be multiples of each 
other, so if we require that m(X) be monic, then m(X) is unique. The polynomial m(X) 
has the following properties: 


(1) If g € F[X], then g(a) = 0 if and only if m(X) divides g(X). 
(2) m(X) is the monic polynomial of least degree such that m(a) = 0. 
) 


Property (1) follows because g(a) = 0 iff g(X) € I, and I = (m(X)), the ideal generated 
by m(X). Property (2) follows from (1). To prove (3), note that if m(X) = h(X)k(X) 
with deg h and degk less than deg m, then either h(a) = 0 or k(a) = 0, so that by (1), 
either h(X) or k(X) is a multiple of m(X), which is impossible. Thus m(X) is irreducible, 
and uniqueness of m(X) follows from (3.1.5). 

The polynomial m(X) is called the minimal polynomial of a over F’', sometimes written 
as min(a, F’). 


3.1.7 Theorem 


If a € E is algebraic over F and the minimal polynomial m(X) of @ over F’ has degree 
n, then F(a) = F[a], the set of polynomials in a with coefficients in F’. In fact, Fla] 
is the set F,-1[a] of all polynomials of degree at most n — 1 with coefficients in F’, and 
l,a,...,a"~' form a basis for the vector space F[a] over the field F. Consequently, 
[F(a): F] =n. 


Proof. Let f(X) be any nonzero polynomial over F' of degree n — 1 or less. Then since 
m(X) is irreducible and deg f < degm, f(X) and m(X) are relatively prime, and there 
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are polynomials a(X) and b(X) over F' such that a(X)f(X) + b(X)m(X) = 1. But 
then a(a)f(a@) = 1, so that any nonzero element of F,_1[a] has a multiplicative inverse. 
It follows that F,-1[a] is a field. (This may not be obvious, since the product of two 
polynomials of degree n—1 or less can have degree greater than n—1, but if deg g > n—1, 
then divide g by m to get g(X) = q(X)m(X) +r(X) where degr(X) < degm(X) =n. 
Replace X by a to get g(a) = r(a) € F,-1[a]. Less abstractly, if m(a) = a2 +a+1=0, 
then a? = —a — 1, at = —a? — a, and so on.) 

Now any field containing F’ and a@ must contain all polynomials in a, in particular 
all polynomials of degree at most n — 1. Therefore F,,-1{a] C Fla] C F(a). But F(a) 
is the smallest field containing F and a, so F(a) C F,~-1[a], and we conclude that 
F(a) = Fla] = F,-1[a]. Finally, the elements 1,a,...,@”~! certainly span F;,_;[a], and 
they are linearly independent because if a nontrivial linear combination of these elements 
were zero, we would have a nonzero polynomial of degree less than that of m(X) with a 
as a root, contradicting (2) of (3.1.6). de 


We now prove a basic multiplicativity result for extensions, after a preliminary dis- 
cussion. 
3.1.8 Lemma 


Suppose that F < k < E, the elements a;,7 € J, form a basis for E over K, and the 
elements 3;,7 € J, form a basis for K over F. (I and J need not be finite.) Then the 
products a;8;,7 € I,j € J, form a basis for F over F’. 


Proof. If y € E, then y is a linear combination of the a; with coefficients a; € K, and 
each a; is a linear combination of the @; with coefficients b;; € F’. It follows that the a;{; 
span E over F’. Now if }7; ; Aijai8; = 0, then D7; Aijai = 0 for all 7, and consequently 
ij = 0 for all i, 7, and the a,{; are linearly independent. & 


3.1.9 The Degree is Multiplicative 


If fF <K<#, then [E: F] =|E: K|[K: F). In particular, [EF : F] is finite if and only if 
[E : K] and [K : F) are both finite. 


Proof. In (3.1.8), we have [E: K] = |J|, [Kk : F] =|J|, and [B: F)=|J||J|. & 
We close this section by proving that every finite extension is algebraic. 
3.1.10 Theorem 
If E is a finite extension of F’, then F is an algebraic extension of F’. 
Proof. Let a € E, and let n = [E: F]. Then 1,a,a?,...,a” are n+ 1 vectors in an 


n-dimensional vector space, so they must be linearly dependent. Thus a is a root of a 
nonzero polynomial with coefficients in F’, which means that a is algebraic over F. & 
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Problems For Section 3.1 


1. Let E be an extension of F’, and let S be a subset of FE. If FS) is the subfield of E 
generated by S over F’,, in other words, the smallest subfield of E containing F and S, 
describe F'(S) explicitly, and justify your characterization. 

2. If for each i € I, K; is a subfield of the field E, the composite of the K; (notation 
\, Ki) is the smallest subfield of E containing every K;. As in Problem 1, describe 
the composite explicitly. 

3. Assume that a is algebraic over F’, with [Fla] : F] = n. If 6 € Fla], show that 
[FG]: F] <n, in fact [FG] : F] divides n. 

4. The minimal polynomial of V2 over the rationals Q is X? — 2, by (3) of (3.1.6). Thus 
Q[V2] consists of all numbers of the form ag + a;V/2, where ap and ay are rational. 
By Problem 3, we know that —1+ V2 has a minimal polynomial over Q of degree at 
most 2. Find this minimal polynomial. 


5. If a is algebraic over F' and ( belongs to Fa], describe a systematic procedure for 
finding the minimal polynomial of (@ over F’. 

6. If E/F and the element a € E is transcendental over F’, show that F(a) is isomorphic 
to F(X), the field of rational functions with coefficients in F’. 

7. Theorem 3.1.3 gives one method of adjoining a root of a polynomial, and in fact there 
is essentially only one way to do this. If F is an extension of F and a € E is algebraic 
over F' with minimal polynomial m(X), let I be the ideal (m(X)) C FX]. Show that 
F(a) is isomorphic to F'[|X]/I. [Define y: F[X] — E by y(f(X)) = f(a), and use 
the first isomorphism theorem for rings.] 

8. In the proof of (3.1.3), we showed that if f is irreducible in F[X], then I = (f) isa 
maximal ideal. Show that conversely, if J is a maximal ideal, then f is irreducible. 

9. Suppose that F < EF < L, with a € L. What is the relation between the minimal 
polynomial of a over F and the minimal polynomial of a over E? 

10. If a,,...,@, are algebraic over F', we can successively adjoin the a; to F' to obtain 
the field Flay,...,Q@,] consisting of all polynomials over F' in the a;. Show that 


[Flai,.--,Qn]: F] < | |[F (a): F] < oo 


=: 


w=1 


3.2 Splitting Fields 


If f is a polynomial over the field F’, then by (3.1.3) we can find an extension E, of F 
containing a root a; of f. If not all roots of f lie in E1, we can find an extension EF» 
of FE, containing another root ag of f. If we continue the process, eventually we reach a 
complete factorization of f. In this section we examine this idea in detail. 

If FE is an extension of F' and ay,...,a, € E, we will use the notation F(a,,..., ax) 
for the subfield of E generated by F' and the a;. Thus Fay ,...,a,) is the smallest 
subfield of F containing all elements of F along with the a;. ( “Smallest” means that 
F(a1,...,@%) is the intersection of all such subfields.) Explicitly, F(ai,...,a%) is the 
collection of all rational functions in the a; with nonzero denominators. 
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3.2.1 Definitions and Comments 


If FE is an extension of F and f € FX], we say that f splits over F if f can be written 
as A(X — a1)-+-(X — ax) for some ay,...,a, € E and AE F. 

(There is a subtle point that should be mentioned. We would like to refer to the a; 
as “the” roots of f, but in doing so we are implicitly assuming that if G is an element of 
some extension E’ of FE and f() = 0, then @ must be one of the a;. This follows upon 
substituting 6 into the equation f(X) = A(X — ay)-+-(X — ax) = 0.) 

If K is an extension of F and f € F(X], we say that K is a splitting field for f over 
F if f splits over k but not over any proper subfield of K containing F’. 

Equivalently, K is a splitting field for f over F if f splits over kK and K is generated 
over F' by the roots a1,...,a% of f, in other words, F(ai,...,a,) = K. For if K isa 
splitting field for f, then since f splits over K we have alla; € K, so F(ay,...,ax%) C K. 
But f splits over F(ai,...,@,%), and it follows that F'(a1,...,a@%) cannot be a proper 
subfield; it must coincide with K. Conversely, if f splits over K and F(aj,...,a,) = K, 
let L be a subfield of K containing F. If f splits over L then all a; belong to L, so 
R= Flot 904) CEC Kiso be kK. 

If f € F[X] and f splits over the extension E of F,, then E contains a unique splitting 
field for f, namely F(aj,...,Qx). 


3.2.2 Proposition 
If f € FX] and deg f = n, then f has a splitting field K over F with [K : F] < nl. 


Proof. We may assume that n > 1. (If f is constant, take K = F.) By (3.1.3), F has an 
extension EF, containing a root a, of f, and the extension F'\(a,)/F' has degree at most 
n. (Since f(a 1) = 0, the minimal polynomial of a; divides f; see (3.1.6) and (3.1.7).) We 
may then write f(X) = (X — a1)"!9(X), where a; is not a root of g and degg < n—-1. 
If g is nonconstant, we can find an extension of F'(a,) containing a root a2 of g, and the 
extension F'(a1, a2) will have degree at most n — 1 over F(a 1). Continue inductively and 
use (3.1.9) to reach an extension of degree at most n! containing all the roots of f. & 


If f € FX] and f splits over E, then we may pick any root a of f and adjoin it to F 
to obtain the extension F(a). Roots of the same irreducible factor of f yield essentially 
the same extension, as the next result shows. 


3.2.3. Theorem 


If a and £ are roots of the irreducible polynomial f € F[X] in an extension E of F’, then 
F (qa) is isomorphic to F(3) via an isomorphism that carries a into @ and is the identity 
on F’. 


Proof. Without loss of generality we may assume f monic (if not, divide f by its leading 
coefficient). By (3.1.6), part (3), f is the minimal polynomial of both a and @. By (3.1.7), 
the elements of F(a) can be expressed uniquely as aj + a1a+--:+@n_1a”~1, where the 
a; belong to F' and n is the degree of f. The desired isomorphism is given by 


a9 + a1 + +++ + Gn-10"~! > ag +18 +--+ +Gn-18"". & 
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If f is a polynomial in FLX] and F is isomorphic to the field F’ via the isomorphism (, 
we may regard f as a polynomial over F’. We simply use i to transfer f. Thus if 
f =ao+a,X +---a,X", then f’ = i(f) = i(ao) + i(a1)X +--+ +%(an)X”. There is only 
a notational difference between f and f’, and we expect that splitting fields for f and f’ 
should also be essentially the same. We prove this after the following definition. 


3.2.4 Definition 


If £ and £” are extensions of F' and 7 is an isomorphism of F and E’, we say that 7 is 
an F-isomorphism if i fixes F, that is, if i(a@) = a for every a € F. F-homomorphisms, 
F-monomorphisms, etc., are defined similarly. 


3.2.5 Isomorphism Extension Theorem 


Suppose that F’ and F” are isomorphic, and the isomorphism 7 carries the polynomial 
f € F[X] to f’ € F’[X]. If K is a splitting field for f over F and K’ is a splitting field 
for f’ over F’, then 7 can be extended to an isomorphism of K and K’. In particular, if 
F = F" and i is the identity function, we conclude that any two splitting fields of f are 
F-isomorphic. 


Proof. Carry out the construction of a splitting field for f over F' as in (3.2.2), and perform 
exactly the same steps to construct a splitting field for f’ over F’. At every stage, there is 
only a notational difference between the fields obtained. Furthermore, we can do the first 
construction inside kK and the second inside kK’. But the comment at the end of (3.2.1) 
shows that the splitting fields that we have constructed coincide with K and Kk’. & 


3.2.6 Example 


We will find a splitting field for f(X) = X? — 2 over the rationals Q. 

If a is the positive cube root of 2, then the roots of f are a,a(—5 + is V3) and 
a(—4— i3V3). The polynomial f is irreducible, either by Eisenstein’s criterion or by the 
observation that if f were factorable, it would have a linear factor, and there is no rational 
number whose cube is 2. Thus f is the minimal polynomial of a, so [Q(a) : Q| = 3. Now 
since a and iV/3 generate all the roots of f, the splitting field is K = Q(a,iV3). (We 
regard all fields in this example as subfields of the complex numbers C.) Since iV3 ¢ Q(a) 
(because Q(a) is a subfield of the reals), [Q(a,iV3) : Q(a)] is at least 2. But iv3 is a 
root of X?+3 € Q(a)[X], so the degree of Q(a, iV3) over Q(a) is a most 2, and therefore 
is exactly 2. Thus 


[K : Q| = [Q(a,iv3) : Q = [Q(a,iv3) : Q(a)][Q(a) : Qi =2x 3=6. 


Problems For Section 3.2 


1. Find a splitting field for f(X) = X? —4X +4 over Q 


2. Find a splitting field K for f(X) = X? —2X +4 over Q, and determine the degree of 
K over Q. 
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3. Find a splitting field K for f(X) = X*— 2 over Q, and determine [K : Ql. 


4. Let C be a family of polynomials over F’, and let Kk be an extension of F’. Show that 
the following two conditions are equivalent: 


(a) Each f €C splits over K, but if F < Kk’ < K, then it is not true that each f € C 
splits over K’. 

(b) Each f € C splits over K, and K is generated over F’ by the roots of all the 
polynomials in C. 


If one, and hence both, of these conditions are satisfied, we say that K is a splitting 
field for C over F. 


5. Suppose that K is a splitting field for the finite set of polynomials {f;,..., f,} over F. 
Express K as a splitting field for a single polynomial f over F’. 


6. If m and n are distinct square-free positive integers greater than 1, show that the 
splitting field Q(./m, \/n) of (X? — m)(X? — n) has degree 4 over Q. 


3.3 Algebraic Closures 


If f is a polynomial of degree n over the rationals or the reals, or more generally over 
the complex numbers, then f need not have any rational roots, or even real roots, but we 
know that f always has n complex roots, counting multiplicity. This favorable situation 
can be duplicated for any field F’, that is, we can construct an algebraic extension C' of F 
with the property that any polynomial in C[X] splits over C. There are many ways to 
express this idea. 


3.3.1 Proposition 


If C is a field, the following conditions are equivalent: 


1 


) Every nonconstant polynomial f € CLX] has at least one root in C. 
2) Every nonconstant polynomial f € C[X] splits over C. 
) 
) 


3) Every irreducible polynomial f € C[X] is linear. 


( 
( 
( 
( 


4) C has no proper algebraic extensions. 


If any (and hence all) of these conditions are satisfied, we say that C is algebraically closed. 


Proof. (1) implies (2): By (1) we may write f = (X — a,)g. Proceed inductively to show 
that any nonconstant polynomial is a product of linear factors. 

(2) implies (3): If f is an irreducible polynomial in C/[X], then by (2.9.1), f is non- 
constant. By (2), f is a product of linear factors. But f is irreducible, so there can be 
only one such factor. 

(3) implies (4): Let E be an algebraic extension of C. If a € E, let f be the minimal 
polynomial of a over C. Then f is irreducible and by (3), f is of the form X — a. But 
thena€C,so F=C. 
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(4) implies (1): Let f be a nonconstant polynomial in CX], and adjoin a root a of f 
to obtain C(a), as in (3.1.3). But then C(q) is an algebraic extension of C, so by (4), 
acc. & 


It will be useful to embed an arbitrary field F in an algebraically closed field. 


3.3.2 Definitions and Comments 


An extension C’ of F is an algebraic closure of F if C is algebraic over F' and C is 
algebraically closed. 

Note that C is minimal among algebraically closed extensions of F. For if F< K <C 
and a € C,a ¢ K, then since a is algebraic over F it is algebraic over kK. But since 
a ¢ K, the minimal polynomial of a over K is a nonlinear irreducible polynomial in 
K(X]. By (3) of (3.3.1), & cannot be algebraically closed. 

If C is an algebraic extension of F’, then in order for C to be an algebraic closure of F’ 
it is sufficient that every polynomial in FLX] (rather than C[X]) splits over C. To prove 
this, we will need the following result. 


3.3.3 Proposition 


If FE is generated over F' by finitely many elements aj,...,@, algebraic over F' (so that 
E = F(qj,...,Q@n)), then E is a finite extension of F’. 


Proof. Set Eo = F and Ex, = F(aj,...,a%), 1 < k <n (so E, = EF). Then Ey = 
Ex—1(Qz), where ax is algebraic over F’ and hence over Ex_-1. But by (3.1.7), [Ex : Ex—1] 
is the degree of the minimal polynomial of a, over Ex—1, which is finite. By (3.1.9), 
[E : F] =[]p_ [Ee : Ex-il< co. & 


3.3.4 Corollary 


If E is an extension of F and A is the set of all elements in E that are algebraic over F 
(the algebraic closure of F in E), then A is a subfield of E. 


Proof. If a,@ € A, then the sum, difference, product and quotient (if 6 4 0) of a and 8 
belong to F(a, 3), which is a finite extension of F' by (3.3.3), and therefore an algebraic 
extension of F by (3.1.10). But then a+ 3,a— ,a8 and a/ belong to A, proving that 
Aisafield. & 


3.3.5 Corollary (Transitivity of Algebraic Extensions) 


If E is algebraic over K (in other words, every element of FE is algebraic over K), and Kk 
is algebraic over F’, then E is algebraic over F. 


Proof. Let a € E, and let m(X) = bp + b)X +--+ + bpp X" 1+ X” be the minimal 
polynomial of a over K. The b; belong to K and are therefore algebraic over F. If 
L = F(bo, b1,...,n-1), then by (3.3.3), L is a finite extension of F’. Since the coefficients 
of m(X) belong to L,a is algebraic over L, so by (3.1.7), L(q@) is a finite extension of L. 
By (3.1.9), L(q@) is a finite extension of F’. By (3.1.10), @ is algebraic over F. & 
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Now we can add another condition to (3.3.1). 


3.3.6 Proposition 


Let C' be an algebraic extension of F. Then C is an algebraic closure of F if and only if 
every nonconstant polynomial in FX] splits over C. 


Proof. The “only if” part follows from (2) of (3.3.1), since F C C. Thus assume that 
every nonconstant polynomial in FLX] splits over C. If f is a nonconstant polynomial 
in C[X], we will show that f has at least one root in C, and it will follow from (1) of 
(3.3.1) that C is algebraically closed. Adjoin a root a of f to obtain the extension C(a). 
Then C(qa) is algebraic over C by (3.1.7), and C is algebraic over F' by hypothesis. By 
(3.3.5), C(a@) is algebraic over F’, so a is algebraic over F’. But then a is a root of some 
polynomial g € F[X], and by hypothesis, g splits over C. By definition of “splits” (see 
(3.2.1)), all roots of g lie in C, in particular a € C. Thus f has at least one root inC. & 


To avoid a lengthy excursion into formal set theory, we argue intuitively to establish 
the following three results. (For complete proofs, see the appendix to Chapter 3.) 


3.3.7 Theorem 


Every field F' has an algebraic closure. 


Informal argument. Well-order FX] and use transfinite induction, beginning with the 
field fy = F. At stage f we adjoin all roots of the polynomial f by constructing a 
splitting field for f over the field Fey that has been generated so far by the recursive 
procedure. When we reach the end of the process, we will have a field C' such that every 
polynomial f in FX] splits over C. By (3.3.6), C is an algebraic closure of F. & 


3.3.8 Theorem 


Any two algebraic closures C and C’ of F are F-isomorphic. 


Informal argument. Carry out the recursive procedure described in (3.3.7) in both C 
and C’. At each stage we may use the fact that any two splitting fields of the same 
polynomial are F’-isomorphic; see (3.2.5). When we finish, we have F’-isomorphic algebraic 
closures of F’, say D C C and D’ C C’. But an algebraic closure is a minimal algebraically 
closed extension by (3.3.2), and therefore D=C and D’/=C". & 


3.3.9 Theorem 


If EF is an algebraic extension of F’, C' is an algebraic closure of F’, and 7 is an embedding 
(that is, a monomorphism) of F' into C, then i can be extended to an embedding of FE 
into C. 
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Informal argument. Each a € E is a root of some polynomial in FX], so if we allow a to 
range over all of FE, we get a collection S$ of polynomials in F[X]. Within C, carry out the 
recursive procedure of (3.3.7) on the polynomials in S. The resulting field lies inside C 
and contains an F-isomorphic copy of EF. & 


Problems For Section 3.3 
1. Show that the converse of (3.3.3) holds, that is, if E is a finite extension of F’, then E 
is generated over F' by finitely many elements that are algebraic over F’. 


2. An algebraic number is a complex number that is algebraic over the rational field Q. 
A transcendental number is a complex number that is not algebraic over Q. Show that 
there only countably many algebraic numbers, and consequently there are uncountably 
many transcendental numbers. 


3. Give an example of an extension C'/F' such that C' is algebraically closed but C is not 
an algebraic extension of F’. 


4. Give an example of an extension E’/F such that FE is an algebraic but not a finite 
extension of F’. 


. In the proof of (3.3.7), why is C algebraic over F’? 

. Show that the set A of algebraic numbers is an algebraic closure of Q. 

. If E is an algebraic extension of the infinite field F’, show that |E| = |F'. 

. Show that any set S of nonconstant polynomials in FX] has a splitting field over F’. 


oe ON DD OH 


. Show that an algebraically closed field must be infinite. 


3.4 Separability 


If f is a polynomial in FX], we can construct a splitting field K for f over F, and all 
roots of f must lie in Kk. In this section we investigate the multiplicity of the roots. 


3.4.1 Definitions and Comments 


An irreducible polynomial f € F[X] is separable if f has no repeated roots in a splitting 
field; otherwise f is inseparable. If f is an arbitrary polynomial, not necessarily irreducible, 
then we call f separable if each of its irreducible factors is separable. 

Thus if f(X) = (X — 1)?(X — 3) over Q, then f is separable, because the irreducible 
factors (X — 1) and (X — 3) do not have repeated roots. We will see shortly that over a 
field of characteristic 0 (for example, the rationals), every polynomial is separable. Here 
is a method for testing for multiple roots. 


3.4.2 Proposition 
If 
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let f’ be the derivative of f, defined by 
f'(X) =a) + 2agX +--+ + nay,X™ |. 


[Note that the derivative is a purely formal expression; we completely ignore questions 
about existence of limits. One can check by brute force that the usual rules for differen- 
tiating a sum and product apply]. 

If g is the greatest common divisor of f and f’, then f has a repeated root in a splitting 
field if and only if the degree of g is at least 1. 


Proof. If f has a repeated root, we can write f(X) = (X—a)"h(X) where r > 2. Applying 
the product rule for derivatives, we see that (X — aq) is a factor of both f and f’, and 
consequently deg g > 1. Conversely, if deg g > 1, let a be a root of g in some splitting 
field. Then (X — qa) is a factor of both f and f’. We will show that a is a repeated root 
of f. If not, we may write f(X) = (X — a)h(X) where h(a) 4 0. Differentiate to obtain 
f'(X) = (X — a)h'(X) + h(X), hence f’(a) = h(a) 4 0. This contradicts the fact that 
(X — a) is a factor of f’. 


3.4.3 Corollary 


(1) Over a field of characteristic zero, every polynomial is separable. 


(2) Over a field F' of prime characteristic p, the irreducible polynomial f is inseparable 
if and only if f’ is the zero polynomial. Equivalently, f is a polynomial in X?; we 
abbreviate this as f € F[X?]. 


Proof. (1) Without loss of generality, we can assume that we are testing an irreducible 
polynomial f. The derivative of X” is nX"~}, and in a field of characteristic 0, n cannot 
be 0. Thus f’ is a nonzero polynomial whose degree is less than that of f. Since f is 
irreducible, the gcd of f and f’ is either 1 or f, and the latter is excluded because f 
cannot possibly divide f’. By (3.4.2), f is separable. 

(2) If f’ # 0, the argument of (1) shows that f is separable. If f’ = 0, then 
ecd(f, f’) = f, so by (3.4.2), f is inseparable. In characteristic p, an integer n is zero if 
and only if n is a multiple of p, and it follows that f’ = 0 iff f ¢ F[X?]. & 


By (3.4.3), part (1), every polynomial over the rationals (or the reals or the complex 
numbers) is separable. This pleasant property is shared by finite fields as well. First 
note that a finite field F' cannot have characteristic 0, since a field of characteristic 0 
must contain a copy of the integers (and the rationals as well), and we cannot squeeze 
infinitely many integers into a finite set. Now recall the binomial expansion modulo p, 
which is simply (a + b)? = a? + bP, since p divides (?) for 1 < k < p—1. [By induction, 
(a+ b)?" =a?" +b?” for every positive integer n.] Here is the key step in the analysis. 


3.4.4 The Frobenius Automorphism 


Let F' be a finite field of characteristic p, and define f: F — F by f(a) =a”. Then f is 
an automorphism. In particular, if a € F then a = G? for some @ € F. 
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Proof. We have f(1) = 1 and 


flat B) = (a+ 8)? = a? + BP = f(a) + f (9), 
f(aB) = (a8)? = a? 6? = f(a) f(P) 


so f is a monomorphism. But an injective function from a finite set to itself is automati- 
cally surjective, and the result follows. d& 


3.4.5 Proposition 


Over a finite field, every polynomial is separable. 


Proof. Suppose that f is an irreducible polynomial over the finite field F' with repeated 
roots in a splitting field. By (3.4.3), part (2), f(X) has the form ap + a, X? +--+ +a,X"? 
with the a; € F. By (3.4.4), for each i there is an element b; € F such that b? = a;. But 
then 


(bo + BX +--+ bn X”)? = OF +OPXP +--+ DPX"? = f(X) 
which contradicts the irreducibility of f. & 


Separability of an element can be defined in terms of its minimal polynomial. 


3.4.6 Definitions and Comments 


If EF is an extension of F and a € E, then a is separable over F if a is algebraic over F 
and min(a, F’) is a separable polynomial. If every element of EF is separable over F’, we say 
that E is a separable extension of F or the extension E'/F is separable or E is separable 
over F’. By (3.4.3) and (3.4.5), every algebraic extension of a field of characteristic zero 
or a finite field is separable. 


3.4.7 Lemma 


If F< K < E and E is separable over F’, then K is separable over F' and F is separable 
over K. 


Proof. Since K is a subfield of E, K/F is separable. If a € E, then since a is a root of 
min(a, F’), it follows from (1) of (3.1.6) that min(a, K’) divides min(a, F’). By hypothesis, 
min(q@, F’) has no repeated roots in a splitting field, so neither does min(a, K). Thus E/Kk 
is separable. & 


The converse of (3.4.7) is also true: If K/F and E/K are separable, then E/F is 
separable. Thus we have transitivity of separable extensions. We will prove this (for finite 
extensions) in the exercises. 

In view of (3.4.6), we can produce many examples of separable extensions. Inseparable 
extensions are less common, but here is one way to construct them. 
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3.4.8 Example 


Let F = F,(t) be the set of rational functions (in the indeterminate t) with coefficients 
in the field with p elements (the integers mod p). Thus an element of F’ looks like 

ao tayt+---+amt™ 

bo + byt +--+ + dnt” © 


with the a; and b; in F,. Adjoin ¥/t, that is, a root of X? — t, to create the extension E. 
Note that X? — t is irreducible by Eisenstein, because t is irreducible in F,[t]. (The 
product of two nonconstant polynomials in t cannot possibly be t.) The extension E'/F 
is inseparable, since 


XP t= XP — (Vor = (X- Hy, 


which has multiple roots. 


Problems For Section 3.4 


1. Give an example of a separable polynomial f whose derivative is zero. (In view of 
(3.4.3), f cannot be irreducible.) 

2. Let a € EF, where E is an algebraic extension of a field F of prime characteristic p. 
Let m(X) be the minimal polynomial of a over the field F(a”). Show that m(X) 
splits over F, and in fact @ is the only root, so that m(X) is a power of (X — a). 

3. Continuing Problem 2, if a is separable over the field F(a”), show that a € F(a”). 

4. A field F' is said to be perfect if every polynomial over F’ is separable. Equivalently, 
every algebraic extension of F is separable. Thus fields of characteristic zero and 
finite fields are perfect. Show that if F has prime characteristic p, then F is perfect 
if and only if every element of F is the p*” power of some element of F’. For short we 
write F = FP, 

In Problems 5-8, we turn to transitivity of separable extensions. 

5. Let E£ be a finite extension of a field F' of prime characteristic p, and let K = F(E?) 
be the subfield of E obtained from F' by adjoining the p*” powers of all elements of 
E. Show that F(E?) consists of all finite linear combinations of elements in E? with 
coefficients in F’. 

6. Let E be a finite extension of the field F' of prime characteristic p, and assume that 
E = F(E?). If the elements yi,...,y, € E are linearly independent over F’', show 
that y?,...,y® are linearly independent over F’. 

7. Let E be a finite extension of the field F' of prime characteristic p. Show that the 
extension is separable if and only if FH = F(E?). 

8 If F< Kk < E with [E: F] < «, with E separable over K and K separable over F, 
show that FE is separable over F’. 

9. Let f be an irreducible polynomial in FX], where F has characteristic p > 0. Express 
f(X) as g(X”"), where the nonnegative integer m is a large as possible. (This makes 
sense because X? = X, so m = 0 always works, and f has finite degree, so m is 
bounded above.) Show that g is irreducible and separable. 
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10. Continuing Problem 9, if f has only one distinct root a, show that a?” € F. 


11. If E/F, where char F' = p > 0, and the element a € F is algebraic over F’, show that 
the minimal polynomial of a over F has only one distinct root if and only if a?" € F 
for some nonnegative integer n. (In this case we say that a is purely inseparable over 


3.5 Normal Extensions 


Let E'/F be a field extension. In preparation for Galois theory, we are going to look at 
monomorphisms defined on EF, especially those which fix F. First we examine what an 
F-monomorphism does to the roots of a polynomial in F'[X]. 


3.5.1 Lemma 


Let o: E > E be an F-monomorphism, and assume that the polynomial f € FX] splits 
over F. If a is a root of f in E, then so is o(a). Thus o permutes the roots of f. 


Proof. If bp) + bia+-+--+b,a” =0, with the 6; € F, apply o and note that since a is an 
F-monomorphism, o(b;) = b; and o(a’) = (o(a))*. Thus 


bo + big (a) +--+ + bn(a(a))” =0. & 


Now let C’ be an algebraic closure of EF. It is convenient to have C available because 
it will contain all the roots of a polynomial f € E[X], even if f does not split over E. We 
are going to count the number of embeddings of F in C' that fix F’, that is, the number 
of F-monomorphisms of E into C. Here is the key result. 


3.5.2 Theorem 


Let E/F be a finite separable extension of degree n, and let o be an embedding of F 
in C. Then o extends to exactly n embeddings of F in C; in other words, there are 
exactly n embeddings 7 of E in C' such that the restriction t|~ of + to F coincides 
with o. In particular, taking o to be the identity function on F’, there are exactly n 
F-monomorphisms of F into C. 


Proof. An induction argument works well. If n = 1 then & = F and there is nothing to 
prove, so assume n > 1 and choose an element a that belongs to EF but not to F. If f is 
the minimal polynomial of a over F’, let g = o(f). (This is a useful shorthand notation, 
indicating that if a; is one of the coefficients of f, the corresponding coefficient of g is 
o(a;).) Any factorization of g can be translated via the inverse of o to a factorization 
of f, so g is separable and irreducible over the field o(F’). If @ is any root of g, then there 
is a unique isomorphism of F(a) and (o(F’))(@) that carries a into @ and coincides with 
o on F, Explicitly, 


bo + bia +--+ +b-a” > o(bp) + 0(b1) 8B +--+ + 0(b,) 8". 
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Now if deg g = r, then [F(a) : F] = deg f = deg g = r as well, so by (3.1.9), [E': F(a)] = 
n/r <n. By separability, g has exactly r distinct roots in C, so there are exactly r 
possible choices of 3. In each case, by the induction hypothesis,the resulting embedding 
of F(a) in C has exactly n/r extensions to embeddings of E in C. This produces n 
distinct embeddings of F in C extending o. But if 7 is any embedding of F in C' that 
extends o, then just as in (3.5.1), 7 must take a to a root of g, i.e., to one of the (’s. 
If there were more than n possible 7’s, there would have to be more than n/r possible 
extensions of at least one of the embeddings of F(a) in C. This would contradict the 
induction hypothesis. d& 


3.5.3 Example 


Adjoin the positive cube root of 2 to the rationals to get E = Q(*/2). The roots of the 
irreducible polynomial f(X) = X? — 2 are W/2, wi/2 and w? V2, where w = e?27/3 = 
—} + si 3 and w2 = e#47/3 = —4 _ siv. 

Notice that the polynomial f has a root in EF but does not split in E (because the 
other two roots are complex and F consists entirely of real numbers). We give a special 


name to extensions that do not have this annoying drawback. 


3.5.4 Definition 


The algebraic extension E/F is normal (we also say that E is normal over F’) if every 
irreducible polynomial over F' that has at least one root in F splits over EF. In other 
words, if a € E, then all conjugates of a over F (i.e., all roots of the minimal polynomial 
of a over F’) belong to E. 

Here is an equivalent condition. 


3.5.5 Theorem 


The finite extension F/F is normal if and only if every F-monomorphism of E into an 
algebraic closure C is actually an F-automorphism of E. (The hypothesis that E/F is 
finite rather than simply algebraic can be removed, but we will not need the more general 
result.) 


Proof. If E/F is normal, then as in (3.5.1), an F-monomorphism 7 of £ into C must map 
each element of E to one of its conjugates. Thus by hypothesis, 7(£) C E. But r(£) is 
an isomorphic copy of EF’ , so it must have the same degree as FE over F’. Since the degree 
is assumed finite, we have 7(£) = E. (All we are saying here is that an m-dimensional 
subspace of an m-dimensional vector space is the entire space.) Conversely, let a € EF, and 
let G@ be any conjugate of a over F’. As in the proof of (3.5.2), there is an F-monomorphism 
of F into C that carries a to @. If all such embeddings are F-automorphisms of E, we 
must have @ € E, and we conclude that F is normal over F. & 


3.5.6 Remarks 


In (3.5.2) and (3.5.5), the algebraic closure can be replaced by any fixed normal extension 
of F' containing E; the proof is the same. Also, the implication 7(£) C E > 7(E) = E 
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holds for any F-monomorphism 7 and any finite extension E'/F’; normality is not involved. 
The next result yields many explicit examples of normal extensions. 


3.5.7 Theorem 


The finite extension E’/F is normal if and only if E is a splitting field for some polynomial 
fc F[X]. 


Proof. Assume that E is normal over F’. Let aj,...,Q@, be a basis for F over F’, and 
let f; be the minimal polynomial of a; over F,i = 1,...,n. Since f; has a root a; in E, 
fi splits over E, hence so does f = f,--- fn. If f splits over a field K with FC K CE, 
then each a; belongs to K, and therefore K must coincide with F. Thus E£ is a splitting 
field for f over F’. Conversely, let EF be a splitting field for f over F’, where the roots of f 
are a;,i=1,...,n. Let 7 be an F-monomorphism of £ into an algebraic closure. As in 
(3.5.1), 7 takes each a; into another root of f, and therefore + takes a polynomial in the 
a; to another polynomial in the a;. But F(a1,...,@,) = E, so T(E) C E. By (3.5.6), 7 
is an automorphism of E, so by (3.5.5), E/F is normal. & 


3.5.8 Corollary 
Let F < K < E, where E is a finite extension of F’. If E/F is normal, so is E/K. 


Proof. By (3.5.7), E is a splitting field for some polynomial f € FX], so that E is 
generated over F' by the roots of f. But then f € K[X] and E is generated over K by 
the roots of f. Again by (3.5.7), E/K is normal. & 


3.5.9 Definitions and Comments 


If E/F is normal and separable, it is said to be a Galois extension; we also say that E is 
Galois over F. It follows from (3.5.2) and (3.5.5) that if E/F is a finite Galois extension, 
then there are exactly [E : F'] F-automorphisms of E. If E/F is finite and separable but 
not normal, then at least one F-embedding of FE into an algebraic closure must fail to be 
an automorphism of EF. Thus in this case, the number of F-automorphisms of FE is less 
than the degree of the extension. 

If E/F is an arbitrary extension, the Galois group of the extension, denoted by 
Gal(E/F), is the set of F-automorphisms of FE. (The set is a group under composition of 
functions.) 


3.5.10 Example 


Let E = Q(¥2), as in (3.5.3). The Galois group of the extension consists of the identity 
automorphism alone. For any Q-monomorphism o of E must take W2 into a root of 
X%— 2. Since the other two roots are complex and do not belong to E, ¥/2 must map to 
itself. But o is completely determined by its action on 2, and the result follows. 

If E/F is not normal, we can always enlarge FE to produce a normal extension of F. 
If C is an algebraic closure of E, then C' contains all the roots of every polynomial in 
F |X], so C/F is normal. Let us try to look for a smaller normal extension. 
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3.5.11 The Normal Closure 


Let EF be a finite extension of F’, say FE = F(a1,...,Q,). If N D Eis any normal extension 
of F’, then N must contain the a; along with all conjugates of the a;, that is, all roots of 
min(a;, F),i =1,...,n. Thus if f is the product of these minimal polynomials, then N 
must contain the splitting field K for f over F. But K/F is normal by (3.5.7), so K must 
be the smallest normal extension of F' that contains E. It is called the normal closure of 
E over F. 

We close the section with an important result on the structure of finite separable 
extensions. 


3.5.12 Theorem of the Primitive Element 


If E/F is a finite separable extension, then E = F(a) for some a € E. We say that a is 
a primitive element of E over F. 


Proof. We will argue by induction on n = [E:: F]. If n = 1 then E = F and we can take 
a to be any member of F’. If n > 1, choose a € E'\ F. By the induction hypothesis, there 
is a primitive element 6 for E over F(a), so that E = F(a,B). We are going to show 
that if c € F' is properly chosen, then F = F(a+c(). Now by (3.5.2), there are exactly n 
F-monomorphisms of F into an algebraic closure C’, and each of these maps restricts to 
an F-monomorphism of F(a +c) into C. If F(a+c@) 4 E, then [F(a+ cf): F] <n, 
and it follows from (3.5.2) that at least two embeddings of E, say o and 7, must coincide 
when restricted. Therefore 


a(a) + co(B) = r(a) + c7(), 


hence 
c= aha) esele) (1) 


(If 7(8) = o(8) then by the previous equation, T(a) = o(a). But an F-embedding 
of E is determined by what it does to a@ and (, hence o = 7, a contradiction.) Now 
an F-monomorphism must map a to one of its conjugates over F’, and similarly for /. 
Thus there are only finitely many possible values for the ratio in (1). If we select c to 
be different from each of these values, we reach a contradiction of our assumption that 
F(a+c) 4 E. The proof is complete if F' is an infinite field. We must leave a gap here, 
to be filled later (see (6.4.4)). If F is finite, then so is FE (since F is a finite-dimensional 
vector space over fF’). We will show that the multiplicative group of nonzero elements of 
a finite field F is cyclic, so if a is a generator of this group, then E = F(a). & 


Problems For Section 3.5 


1. Give an example of fields F < K < E such that E/F is normal but K/F is not. 


2. Let E = Q(/a), where a is an integer that is not a perfect square. Show that E'/Q is 
normal. 
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3. Give an example of fields F < K < E such that E/K and K/F are normal, but E/F 
is not. Thus transitivity fails for normal extensions. 


4. Suppose that in (3.5.2), the hypothesis of separability is dropped. State and prove an 
appropriate conclusion. 

5. Show that EF = Q(V2, V3) is a Galois extension of Q. 

6. In Problem 5, find the Galois group of E/Q. 

7. Let E be a finite extension of F’, and let Kk be a normal closure (= minimal normal 
extension) of F over F’, as in (3.5.11). Is K unique? 


8. If E; and E> are normal extensions of F’, show that E,; 9 E>, is normal over F’. 


Appendix To Chapter 3 


In this appendix, we give a precise development of the results on algebraic closure treated 
informally in the text. 


A3.1 Lemma 


Let E be an algebraic extension of F’, and let 0: E — E be an F-monomorphism. Then 
a is surjective, hence o is an automorphism of E. 


Proof. Let a € E, and let f(X) be the minimal polynomial of a over F’. We consider the 
subfield L of E generated over F' by the roots of f that lie in E. Then L is an extension 
of F that is finitely generated by algebraic elements, so by (3.3.3), D/F is finite. As in 
(3.5.1), o takes a root of f to a root of f, so o(Z) C L. But [L: F] = [o(L): F] < x 
(o maps a basis to a basis), and consequently o(L) = L. Buta e€L,soaea(L). & 


The following result, due to Artin, is crucial. 


A3.2 Theorem 
If F is any field, there is an algebraically closed field F containing F’. 


Proof. For each nonconstant polynomial f in FLX], we create a variable X(f). If T is the 
collection of all such variables, we can form the ring F'[T] of all polynomials in all possible 
finite sets of variables in T, with coefficients in F’. Let I be the ideal of F'[T] generated 
by the polynomials f(X(f)), f € F[X]. We claim that J is a proper ideal. If not, 
then 1 € J, so there are finitely many polynomials f,,..., f, in F[X] and polynomials 
hy,---,hn in F[T] such that 77, hi fi(X(fi)) = 1. Now only finitely many variables 
X; = X(fi),i = 1,...,m, can possibly appear in the h;, so we have an equation of the 


n 


S 7 hi(X1,-.-) Xm) fi( Xi) = 1 (1) 


i=l 
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where m > n. Let L be the extension of F formed by successively adjoining the roots of 
fi,---, fn. Then each f; has a root a; € L. If we set a; = 0 for n <i < _m and then set 
Xj; = a; for each ¢ in (1), we get 0 = 1, a contradiction. 

Thus the ideal I is proper, and is therefore contained in a maximal ideal M. Let EF, 
be the field F[T]/M. Then E, contains an isomorphic copy of F’, via the map taking 
a€Ftoa+Me EF. (Note that if ae M,a4 0, then 1 = a~!a € M, a contradiction.) 
Consequently, we can assume that F < FE). If f is any nonconstant polynomial in FX], 
then X(f)+M € Fy and f(X(f)+M) = f(X(f)) + M =0 because f(X(f)) Ee 1 CM. 

Iterating the above procedure, we construct a chain of fields F < FE, < Ey <--+ such 
that every polynomial of degree at least 1 in E,,[X] has a root in E,4,. The union F of 
all the E,, is a field, and every nonconstant polynomial f in E[X] has all its coefficients 
in some F,,. Therefore f has a root in Fn; C FE. & 


A3.3 Theorem 


Every field F' has an algebraic closure. 


Proof. By (A3.2), F has an algebraically closed extension L. If E is the algebraic closure 
of F in L (see 3.3.4), then E'/F is algebraic. Let f be a nonconstant polynomial in E[X]. 
Then f has a root a in L (because L is algebraically closed). We now have a algebraic 
over F' (because f € E[X]), and E algebraic over F’. As in (3.3.5), a is algebraic over F, 
hence a € E. By (3.3.1), E is algebraically closed. & 


A3.4 Problem 


Suppose that o is a monomorphism of F' into the algebraically closed field L. Let E be 
an algebraic extension of F', and a an element of E with minimal polynomial f over F. 
We wish to extend o to a monomorphism from F(a) to L. In how many ways can this 
be done? 

Let of be the polynomial in (0 F’)[X] obtained from f by applying o to the coefficients 
of f. Any extension of f is determined by what it does to a, and as in (3.5.1), the image 
of a is a root of of. Now the number of distinct roots of f in an algebraic closure of F, 
call it t, is the same as the number of distinct roots of of in L; this follows from the 
isomorphism extension theorem (3.2.5). Thus the number of extensions is at most t. 
But if @ is any root of of, we can construct an extension of o by mapping the element 
h(a) € F(a) to (oh)(8); in particular, a is mapped to 3. To show that the definition 
makes sense, suppose that hi(a) = ho(a). Then (hi — h2)(a) = 0, so f divides hy — h2 
in FX]. Consequently, of divides chy — chg in (oc F)[X], so (oh) (8) = (cha) (8). 

We conclude that the number of extensions of o is the number of distinct roots of f 
in an algebraic closure of F’. 

Rather than extend o one element at a time, we now attempt an extension to all of E. 


A3.5 Theorem 


Let o: F — L be a monomorphism, with L algebraically closed. If F is an algebraic 
extension of F’, then o has an extension to a monomorphism 7: EF — L. 
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Proof. Let G be the collection of all pairs (K, 44) where K is an intermediate field between 
F and E and wp is an extension of o to a monomorphism from K to L. We partially 
order G by (Ay, 1) < (Ko, p) iff Ky C Ko and p restricted to Ky coincides with p. Since 
(Fic) € G, we have G # 9. If the pairs (Kj, u;),i € I, form a chain, there is an upper 
bound (K, 44) for the chain, where K is the union of the K; and p coincides with p; on 
each K;. By Zorn’s lemma, G has a maximal element (Ko,7T). If Ko C E, let a€ E\ Ko. 
By (A3.4), 7 has an extension to Ko(q@), contradicting maximality of (Ko,T). & 


A3.6 Corollary 


In (A3.5), if & is algebraically closed and L is algebraic over o(F'), then 7 is an isomor- 
phism. 


Proof. Since E is algebraically closed, so is T(F). Since L is algebraic over o(F'), it is 
algebraic over the larger field 7(£). By (1) => (4) in (3.3.1), L=r(E). & 

A3.7 Theorem 

Any two algebraic closures L and F of a field F' are F'-isomorphic. 


Proof. We can assume that F is a subfield of Z and o: F — L is the inclusion map. By 
(A3.6), o extends to an isomorphism 7 of F and L, and since 7 is an extension of ¢, it is 
an F-monomorphism. & 


A3.8 Theorem (=Theorem 3.3.9) 


If E is an algebraic extension of F and C is an algebraic closure of F’, then any embedding 
of F' into C can be extended to an embedding of E into C. 


Proof. This is a special case of (A3.5). & 


A3.9 Remark 


The argument just given assumes that F is a subfield of C’. This can be assumed without 
loss of generality, by (A3.2), (A3.3) and (A3.7). In other words, we can assume that an 
algebraic closure of F' contains a specified algebraic extension of F’. 


A3.10 Theorem 


Let FE be an algebraic extension of F’, and let L be the algebraic closure of F' containing EF 
[see (A3.9)]. If o is an F-monomorphism from £ to L, then o can be extended to an 
automorphism of LD. 


Proof. We have L algebraically closed and L/E algebraic, so by (A3.5) with E replaced 
by L and F by E, o extends to a monomorphism from L to L, an F-monomorphism by 
hypothesis. The result follows from (A3.1). & 
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Chapter 4 


Module Fundamentals 


4.1 Modules and Algebras 


4.1.1 Definitions and Comments 


A vector space M over a field R is a set of objects called vectors, which can be added, 
subtracted and multiplied by scalars (members of the underlying field). Thus M is an 
abelian group under addition, and for each r € Rand x € M we have an element rx € M. 
Scalar multiplication is distributive and associative, and the multiplicative identity of the 
field acts as an identity on vectors. Formally, 


riaty)=ratry; (rts)t=ratsa; r(sx)=(rs)x; la=ax 


for all z,y € M andr,s € R. A module is just a vector space over a ring. The formal 
definition is exactly as above, but we relax the requirement that R be a field, and instead 
allow an arbitrary ring. We have written the product rx with the scalar r on the left, and 
technically we get a left R-module over the ring R. The axioms of a right R-module are 


(a+y)r=ar+tyr; a(r+s)=ar+as; (xs)r=a(sr), cl=a. 


“Module” will always mean left module unless stated otherwise. Most of the time, there 
is no reason to switch the scalars from one side to the other (especially if the underlying 
ring is commutative). But there are cases where we must be very careful to distinguish 
between left and right modules (see Example 6 of (4.1.3)). 


4.1.2 Some Basic Properties of Modules 


Let M be an R-module. The technique given for rings in (2.1.1) can be applied to establish 
the following results, which hold for any x € M andr € R. We distinguish the zero vector 
Oy from the zero scalar Op. 


(1) TOM =0Onm [rOn _ r(Om + 0m) =r0u +r0u] 


(2) Ort =Oy [Ore = (0R + OR) = One + Ore] 
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(3) (-r)a = r(—a) = —(ra) [as in (2) of (2.1.1) with a replaced by r and b by a] 


(4) If R is a field, or more generally a division ring, then rz = O,¢ implies that either 


r=0r orv=O0y. [If r 4 0, multiply the equation rz = 0, by r~+.] 


4.1.3 Examples 


bs 
2. 


If M is a vector space over the field R, then M is an R-module. 


Any ring R is a module over itself. Rather than check all the formal requirements, 
think intuitively: Elements of a ring can be added and subtracted, and we can 
certainly multiply r € R by x € R, and the usual rules of arithmetic apply. 


. If R is any ring, then R”, the set of all n-tuples with components in R, is an 


R-module, with the usual definitions of addition and scalar multiplication (as in 
Euclidean space, e.g., r(@1,.--,%n) = (rv1,...,1Xn), etc). 


. Let M = Myn(R) be the set of all m x n matrices with entries in R. Then M is 


an R-module, where addition is ordinary matrix addition, and multiplication of the 
scalar c by the matrix A means multiplication of each entry of A by c. 


. Every abelian group A is a Z-module. Addition and subtraction is carried out 


according to the group structure of A; the key point is that we can multiply « € A 
by the integer n. If n > 0, then na = x+a+---+a (n times); ifn < 0, then 
nx =—x£-—a“4—-:+-—2 (|n| times). 


In all of these examples, we can switch from left to right modules by a simple notational 
change. This is definitely not the case in the next example. 


6. 


Let I be a left ideal of the ring R; then J is a left R-module. (Ifa eI andre R 
then ra (but not necessarily zr) belongs to I.) Similarly, a right ideal is a right 
R-module, and a two-sided ideal is both a left and a right R-module. 


An R-module M permits addition of vectors and scalar multiplication. If multiplica- 
tion of vectors is allowed, we have an f-algebra. 


4.1.4 Definitions and Comments 


Let R be a commutative ring. We say that M is an algebra over R, or that M is an 
R-algebra, if M is an R-module that is also a ring (not necessarily commutative), and the 
ring and module operations are compatible, i.e., 


r(ay) = (ra)y = a(ry) for allz,ye€ M andre R. 


4.1.5 Examples 


a 


2. 


Every commutative ring R is an algebra over itself (see Example 2 of (4.1.3)). 


An arbitrary ring R is always a Z-algebra (see Example 5 of (4.1.3)). 
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3. If R is a commutative ring, then M,,(R), the set of all n x m matrices with entries 
in R, is an R-algebra (see Example 4 of (4.1.3)). 


4. If R is a commutative ring, then the polynomial ring R[X] is an R-algebra, as 
is the ring R[[X]] of formal power series; see Examples 5 and 6 of (2.1.3). The 
compatibility condition is satisfied because an element of R can be regarded as a 
polynomial of degree 0. 


5. If E/F is a field extension, then F is an algebra over F’. This continues to hold if E 
is a division ring, and in this case we say that FE is a division algebra over F. 


To check that a subset S of a vector space is a subspace, we verify that S is closed 
under addition of vectors and multiplication of a vector by a scalar. Exactly the same 
idea applies to modules and algebras. 


4.1.6 Definitions and Comments 


If N is a nonempty subset of the R-module M, we say that N is a submodule of M 
(notation N < M) if for every x,y € N and r,s € R, we have rx + sy € N. If M is an 
R-algebra, we say that N is a subalgebra if N is a submodule that is also a subring. 

For example, if A is an abelian group (= Zmodule), the submodules of A are the 
subsets closed under addition and multiplication by an integer (which amounts to addition 
also). Thus the submodules of A are simply the subgroups. If R is a ring, hence a 
module over itself, the submodules are those subsets closed under addition and also under 
multiplication by any r € R, in other words, the left ideals. (If we take R to be a right 
R-module, then the submodules are the right ideals.) 

We can produce many examples of subspaces of vector spaces by considering kernels 
and images of linear transformations. A similar idea applies to modules. 


4.1.7 Definitions and Comments 


Let M and N be R-modules. A module homomorphism (also called an R-homomorphism) 
from M to N isa map f: M— N such that 


f(rat+ sy) =rf(x)+ sf(y) for all z,y € M andr,s€ R. 


Equivalently, f(a + y) = f(x) + f(y) and f(rz) =rf(x) for all z,ye M andre R. 

The kernel of a homomorphism f is ker f = {a € M: f(x) = O}, and the image of f 
is {f(x): 2 € M}. 

If follows from the definitions that the kernel of f is a submodule of M, and the image 
of f is a submodule of N. 

If M and N are R-algebras, an algebra homomorphism or homomorphism of algebras 
from M to N is an R-module homomorphism that is also a ring homomorphism. 


4.1.8 Another Way to Describe an Algebra 


Assume that A is an algebra over the commutative ring R, and consider the map r > rl 
of R into A. The commutativity of R and the compatibility of the ring and module 
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operations imply that the map is a ring homomorphism. To see this, note that if r,s €¢ R 
then 


(rs)1 = (sr)1 = s(rl) = s[(r1)1] = (r1)(s1). 


Furthermore, if y € A then 


(r1)y = r(ly) = r(yl) = y(r1) 


so that r1 belongs to the center of A, i.e., the set of elements that commute with everything 
in A. 

Conversely, if f is a ring homomorphism from the commutative ring R to the center 
of the ring A, we can make A into an R-module via rx = f(r)x. The compatibility 
conditions are satisfied because 


r(ey) = f(r)(ey) = (F(r)@)y = (re)y 


and 


(F(r)e)y = (af(r))y = eF(r)y) = 2(ry). 


Because of this result, the definition of an R-algebra is sometimes given as follows. The 
ring A is an algebra over the commutative ring R if there exists a ring homomorphism of 
R into the center of A. For us at this stage, such a definition would be a severe overdose 
of abstraction. 

Notational Convention: We will often write the module {0} (and the ideal {0} in 
a ring) simply as 0. 


Problems For Section 4.1 
1. If J is an ideal of the ring R, show how to make the quotient ring R/J into a left 
R-module, and also show how to make R/TJ into a right R-module. 


2. Let A be a commutative ring and Fa field. Show that A is an algebra over F if and 
only if A contains (an isomorphic copy of) Fas a subring. 


Problems 3, 4 and 5 illustrate that familiar properties of vector spaces need not hold 
for modules. 


3. Give an example of an R-module M with nonzero elements r € R and x € M such 
that ra = 0. 


4. Let M be the additive group of rational numbers. Show that any two elements of M 
are linearly dependent (over the integers Z). 


5. Continuing Problem 4, show that M cannot have a basis, that is, a linearly independent 
spanning set over Z. 


6. Prove the modular law for subgroups of a given group G: With the group operation 
written multiplicatively, 


A(BNC) =(AB)NC 
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if A C C. Switching to additive notation, we have, for submodules of a given R-module, 
A+(BNC)=(A+B)NC, 


again if ACC. 
7. Let T be a linear transformation on the vector space V over the field Ff’. Show how to 
make V into an R-module in a natural way, where R is the polynomial ring F[X]. 


4.2 The Isomorphism Theorems For Modules 


If N is a submodule of the R-module M (notation N < M), then in particular N is an 
additive subgroup of M, and we may form the quotient group M/N in the usual way. 
In fact M/N becomes an R-module if we define r(a + N) = ra + N. (This makes sense 
because if x belongs to the submodule N, so does rx.) Since scalar multiplication in the 
quotient module M/N is carried out via scalar multiplication in the original module M, 
we can check the module axioms without difficulty. The canonical map 7: M — M/N is 
a module homomorphism with kernel N. Just as with groups and rings, we can establish 
the basic isomorphism theorems for modules. 


4.2.1 Factor Theorem For Modules 


Any module homomorphism f: M — M’ whose kernel contains N can be factored through 
M/N. In other words, there is a unique module homomorphism f: M/N — M' such that 


f(c+N) = f(x). Furthermore, (i) f is an epimorphism if and only if f is an epimorphism; 


(ii) f is a monomorphism if and only if ker f = N; (iii) f is an isomorphism if and only 
if f is a epimorphism and kerf = N. 

Proof. Exactly as in (2.3.1), with appropriate notational changes. (In Figure 2.3.1, replace 
Rby M, S by M’ andI by N.) & 

4.2.2 First Isomorphism Theorem For Modules 


If f: M — M’ is a module homomorphism with kernel N, then the image of f is isomor- 
phic to M/N. 


Proof. Apply the factor theorem, and note that f is an epimorphism onto its image. d& 


4.2.3. Second Isomorphism Theorem For Modules 


Let S and T be submodules of M, and let S+T={e+y:x2¢€S,yeT}. Then $+T 
and SMT are submodules of M and 


(S+T)/T&=S/(SNT). 


Proof. The module axioms for $+ T and SMT can be checked in routine fashion. Define 
amap f: S > M/T by f(a) =«+T. Then f is a module homomorphism whose kernel 
is ST and whose image is {cr +T: « € S} =(S+T)/T. The first isomorphism theorem 
for modules gives the desired result. & 
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4.2.4 Third Isomorphism Theorem For Modules 


If N<L <M, then 
M/L™ (M/N)/(L/N). 


Proof. Define f: M/N — M/L by f(a+N) =2+4+ L. As in (2.3.4), the kernel of f is 
{a+N:2€L}=L/N, and the image of f is {e+ L: « € M} = M/L. The result follows 
from the first isomorphism theorem for modules. & 


4.2.5 Correspondence Theorem For Modules 


Let N be a submodule of the R-module M. The map S — S/N sets up a one-to-one 
correspondence between the set of all submodules of M containing N and the set of all 
submodules of M/N. The inverse of the map is T > m~1(T), where 7 is the canonical 
map: M > M/N. 


Proof. The correspondence theorem for groups yields a one-to-one correspondence be- 
tween additive subgroups of M containing N and additive subgroups of M/N. We 
must check that submodules correspond to submodules, and it is sufficient to show 
that if S/N < S/N, then S$; < Sp» (the converse is immediate). If « € $4, then 
ct+NeES,/N C S/N, sox+N=y+N for some y € Sg. Thus «—y € N C So, and 
since y € Sg we must have x € S_ as well. The ore S$; < Sp. & 


We now look at modules that have a particularly simple structure, and can be used 
as building blocks for more complicated modules. 


4.2.6 Definitions and Comments 


An R-module M is cyclic if it is generated by a single element x. In other words, 
M =Rar={ra:re R}. 


Thus every element of M is a scalar multiple of x. (If « = 0, then M = {0}, which is 
called the zero module and is often written simply as 0.) A cyclic vector space over a field 
is a one-dimensional space, assuming that x 4 0. 

The annithilator of an element y in the R-module M is I, = {r € R: ry = 0}, a left 
ideal of R. If R is commutative, and M is cyclic with generator x, then M & R/I,. 
To see this, apply the first isomorphism theorem for modules to the map r > ra of R 
onto M. The annihilator of the module M is I, = {r € R: ry = 0 for every y € M}. 
Note that I, is a two-sided ideal, because if r € Ip and s € R, then for every y € M we 
have (rs)y = r(sy) = 0. When R is commutative, annihilating the generator of a cyclic 
module is equivalent to annihilating the entire module. 
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4.2.7 Lemma 


(a) If a generates a cyclic module M over the commutative ring R, then I, = I, so 
that M@ = R/I,. (In this situation, I, is frequently referred to as the order ideal of M.) 
(b) Two cyclic R-modules over a commutative ring are isomorphic if and only if they 
have the same annihilator. 


Proof. (a) If ra = 0 and y € M, then y = sz for some s € R, so ry = r(sx) = s(ra) 
s0 = 0. Conversely, if r annihilates M/, then in particular, ra = 0. 

(b) The “if” part follows from (a), so assume that g: Rx — Ry is an isomorphism 
of cyclic R-modules. Since g is an isomorphism, g(x) must be a generator of Ry, so we 
may as well take g(x) = y. Then g(ra) = rg(x) = ry, so rx corresponds to ry under the 
isomorphism. Therefore r belongs to the annihilator of Ra if and only if r belongs to the 
annihilator of Ry. & 


Problems For Section 4.2 


1. Show that every submodule of the quotient module M/N can be expressed as (L+N)/N 
for some submodule L of M. 


2. In Problem 1, must LZ contain N? 


3. In the matrix ring M,,(R), let M be the submodule generated by Fy, the matrix with 
1 in row 1, column 1, and 0’s elsewhere. Thus M = {AE},: A € M,,(R)}. Show that 
M consists of all matrices whose entries are zero except perhaps in column 1. 


4. Continuing Problem 3, show that the annihilator of £; consists of all matrices whose 
first column is zero, but the annihilator of WM is {0}. 


5. If I is an ideal of the ring R, show that R/T is a cyclic R-module. 


6. Let M be an R-module, and let J be an ideal of R. We wish to make M into an 
R/I-module via (r + I)m =rm,r € R,m € M. When will this be legal? 


7. Assuming legality in Problem 6, let M4, be the resulting R/I-module, and note that as 
sets, M, = M. Let N bea subset of M and consider the following two statements: 
(a) N is an R-submodule of M; 
(b) N is an R/I-submodule of Mj. 


Can one of these statements be true and the other false? 


4.3. Direct Sums and Free Modules 


4.3.1 Direct Products 


In Section 1.5, we studied direct products of groups, and the basic idea seems to carry 
over to modules. Suppose that we have an R-module M; for each 7 in some index set I 
(possibly infinite). The members of the direct product of the M;, denoted by [],-, Mi, are 
all families (a;,i € I), where a; € M;. (A family is just a function on I whose value at the 
element 7 is a;.) Addition is described by (a;) + (6;) = (a; + 6;) and scalar multiplication 
by r(a;) = (raj). 
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There is nothing wrong with this definition, but the resulting mathematical object has 
some properties that are undesirable. If (e;) is the family with 1 in position i and zeros 
elsewhere, then (thinking about vector spaces) it would be useful to express an arbitrary 
element of the direct product in terms of the e;. But if the index set I is infinite, we 
will need concepts of limit and convergence, and this will take us out of algebra and into 
analysis. Another approach is to modify the definition of direct product. 


4.3.2 Definitions 


The external direct sum of the modules M;,i € I, denoted by @je7M;j, consists of all 
families (a;,i € I) with a; € M;, such that a; = 0 for all but finitely many i. Addition 
and scalar multiplication are defined exactly as for the direct product, so that the external 
direct sum coincides with the direct product when the index set J is finite. 

The R-module M is the internal direct sum of the submodules M; if each x € M can 
be expressed uniquely as 7;, +---+2;, where0 A a;, € Mi,,k =1,...,n. (The positive 
integer n and the elements x;, depend on x. In any expression of this type, the indices 
iz are assumed distinct.) 

Just as with groups, the internal and external direct sums are isomorphic. To see this 
without a lot of formalism, let the element x;, € M!;, correspond to the family that has 
x;, in position 7, and zeros elsewhere. We will follow standard practice and refer to the 
“direct sum” without the qualifying adjective. Again as with groups, the next result may 
help us to recognize when a module can be expressed as a direct sum. 


4.3.3. Proposition 


The module M is the direct sum of submodules M; if and only if both of the following 
conditions are satisfied: 


(1) M=)0, Mi, that is, each x € M is a finite sum of the form a;, + --- + 2,,, where 
Li, E Mi,3 


(2) For each i, MiN 0,4; Mj = 0. 


(Note that in condition (1), we do not assume that the representation is unique. Observe 
also that another way of expressing (2) is that if v;,+---+a;, =0, with z;, € M;,, then 
x4, = 0 for all k.) 


Proof. The necessity of the conditions follows from the definition of external direct sum, 
so assume that (1) and (2) hold. If « € M then by (1), a is a finite sum of elements 
from various M;’s. For convenience in notation, say v = 41+ %2+ 234+ 2&4 with a; € Mi, 
z= 1,2,3,4. If the representation is not unique, say x = y; + yo + ya + y5 + yo with 
y, € M;, 1 = 1,2,4,5,6. Then x3 is a sum of terms from modules other than M3, so 
by (2), 23 = 0. Similarly, ys = ye = 0 and we have 71 +72 +24 = yi tyot ys. But 
then x1 — y; is a sum of terms from modules other than M1, so by (2), 21 = yi. Similarly 
t= y2, T4 = y4, and the result follows. & 
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A basic property of the direct sum M = @;¢;M; is that homomorphisms f;: M; > N 
can be “lifted” to M. In other words, there is a unique homomorphism f: M — N such 
that for each i, f = f; on M;. Explicitly, 


Pe Be) Fale Soh ee fe as) 


[No other choice is possible for f, and since each f; is a homomorphism, so is f.] 
We know that every vector space has a basis, but not every module is so fortunate; 
see Section 4.1, Problem 5. We now examine modules that have this feature. 


4.3.4 Definitions and Comments 


Let S' be a subset of the R-module M. We say that S is linearly independent over R if 
Ait +:+++Ag2~ = 0 implies that all \; = 0 (A; © Rix; € Sk =1,2,...). We say that S 
is a spanning (or generating) set for M over R, or that S spans (generates) M over R if 
each x € M can be written as a finite linear combination of elements of S with coefficients 
in R. We will usually omit “over R” if the underlying ring R is clearly identified. A basis 
is a linearly independent spanning set, and a module that has a basis is said to be free. 

Suppose that M is a free module with basis (b;,i € I), and we look at the submodule 
M; spanned by the basis element b;. (In general, the submodule spanned (or generated) by 
a subset T of M consists of all finite linear combinations of elements of T with coefficients 
in R. Thus the submodule spanned by 8; is the set of all rb;,r € R.) If R is regarded as 
a module over itself, then the map r — rb; is an R-module isomorphism of R and Mj, 
because {b;} is a linearly independent set. Since the b; span M, it follows that M is the 
sum of the submodules M;, and by linear independence of the b;, the sum is direct. Thus 
we have an illuminating interpretation of a free module: 


A free module is a direct sum of isomorphic copies of the underlying ring R. 


Conversely, a direct sum of copies of R is a free R-module. If e; has 1 as its i“” component 
and zeros elsewhere, the e; form a basis. 
This characterization allows us to recognize several examples of free modules. 


1. For any positive integer n, R” is a free R-module. 


2. The matrix ring Mmn(R) is a free R-module with basis E;;, i = 1,...,m, 
PSL eM 


3. The polynomial ring R[X] is a free R-module with basis 1, X, X?,.... 
We will adopt the standard convention that the zero module is free with the empty set 
as basis. 

Any two bases for a vector space over a field have the same cardinality. This property 
does not hold for arbitrary free modules, but the following result covers quite a few cases. 


4.3.5 Theorem 


Any two bases for a free module M over a commutative ring R have the same cardinality. 
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Proof. If I is a maximal ideal of R, then k = R/T is a field, and V = M/IM is a vector 
space over k. [By IM we mean all finite sums 5+ a,x; with a; € I and x; € M; thus IM 
is a submodule of M. Ifr+I¢kandx+IM € M/IM, we take (r+ I)(x+IM) to be 
rx+IM. The scalar multiplication is well-defined because r € I or x € IM implies that 
ra € IM. We can express this in a slightly different manner by saying that J annihilates 
M/IM. The requirements for a vector space can be checked routinely. ] 

Now if (a;) isa basis for M, let ; = «;+1M. Since the x; span M, the Z; span M/IM. 
If > a,x; = 0, where @; = a, + _I,a; € R, then > ajx; € IM. Thus )> a;x; = 90 b,x; with 
b; € I. Since the x; form a basis, we must have a; = 6; for some 7. Consequently a; € J, 
so that @ = 0 in k. We conclude that the %; form a basis for V over k, and since the 
dimension of V over k depends only on M, R and J, and not on a particular basis for M, 
the result follows. d& 


4.3.6 Some Key Properties of Free Modules 


Suppose that M is a free module with basis (2;), and we wish to construct a module 
homomorphism f from M to an arbitrary module N. Just as with vector spaces, we can 
specify f(a;) = y; € N arbitrarily on basis elements, and extend by linearity. Thus if 
x = daz; € M, we have f(x) = > aiy;. (The idea should be familiar; for example, 
a linear transformation on Euclidean 3-space is determined by what it does to the three 
standard basis vectors.) Now let’s turn this process around: 


If N is an arbitrary module, we can express N as a homomorphic image of a 
free module. 


All we need is a set (y;,i € I) of generators for N. (If all else fails, we can take the y; to 
be all the elements of N.) We then construct a free module with basis (x;,7 € I). (To do 
this, take the direct sum of copies of R, as many copies as there are elements of J.) Then 
map 2x; to y; for each 2. 

Note that by the first isomorphism theorem, every module is a quotient of a free 
module. 


Problems For Section 4.3 

1. Show that in Proposition 4.3.3, (2) can be replaced by the weaker condition that for 
each i, Mi >) ,<; Mj =0. (Assume a fixed total ordering on the index set.) 

2. Let A be a finite abelian group. Is it possible for A to be a free Z-module? 


3. Let r and s be elements in the ideal J of the commutative ring R. Show that r and s 
are linearly dependent over R. 

4. In Problem 3, regard IJ as an R-module. Can I be free? 

5. Give an example of an infinite abelian group that is a free Z-module, and an example 
of an infinite abelian group that is not free. 


6. Show that a module M is free if and only if M has a subset S such that any function 
f from S to a module N can be extended uniquely to a module homomorphism from 
M to N. 
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7. Let M be a free module, expressed as the direct sum of a@ copies of the underlying ring 
R, where a and |R| are infinite cardinals. Find the cardinality of M. 


8. In Problem 7, assume that all bases B have the same cardinality, e.g., R is commutative. 
Find the cardinality of B. 


4.4 Homomorphisms and Matrices 


Suppose that M is a free R-module with a finite basis of n elements v1,..., Un, Sometimes 
called a free module of rank n. We know from Section 4.3 that M is isomorphic to the 
direct sum of n copies of R. Thus we can regard M as R”, the set of all n-tuples with 
components in R. Addition and scalar multiplication are performed componentwise, as 
in (4.1.3), Example 3. Note also that the direct sum coincides with the direct product, 
since we are summing only finitely many modules. 

Let N be a free R-module of rank m, with basis w,,...,wm, and suppose that f is a 
module homomorphism from M to N. Just as in the familiar case of a linear transforma- 
tion on a finite-dimensional vector space, we are going to represent f by a matrix. For 
each j, f(v;) is a linear combination of the basis elements w;, so that 


f(vj) = > aywi, j= 1,...47 (1) 
i=l 


where the a;; belong to R. 

It is natural to associate the m x n matrix A with the homomorphism f, and it 
appears that we have an isomorphism of some sort, but an isomorphism of what? If f 
and g are homomorphisms of M into N, then f and g can be added (and subtracted): 
(f + g)(x) = f(x) + g(a). If f is represented by the matrix A and g by B, then f +g 
corresponds to A+ B. This gives us an abelian group isomorphism of Homr(M, N), the 
set of all R-module homomorphisms from M to N, and Minn(R), the set of all m x n 
matrices with entries in R. In addition, Mny(R) is an R-module, so it is tempting to 
say “obviously, we have an R-module isomorphism”. But we must be very careful here. 
If f € Homr(M,N) and s € R, we can define sf in the natural way: (sf)(x) = sf(x). 
However, if we carry out the “routine” check that sf € Homr(M, N), there is one step 
that causes alarm bells to go off: 


(sf)(rx) = sf(r2) = srf(x), but r(sf)(@) = rsf(x) 


and the two expressions can disagree if R is not commutative. Thus Homr(M, N) need 
not be an R-module. Let us summarize what we have so far. 


4.4.1 The Correspondence Between Homomorphisms 
and Matrices 


Associate with each f € Homr(M, N) a matrix A as in (1) above. This yields an abelian 
group isomorphism, and also an R-module isomorphism if R is commutative. 

Now let m = n, so that the dimensions are equal and the matrices are square, and 
take v; = w; for all i. A homomorphism from M to itself is called an endomorphism of 
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M, and we use the notation Endr(M) for Homr(M, M). Since Endr(M) is a ring under 
composition of functions, and M,,(R) is a ring under matrix multiplication, it is plausible 
to conjecture that we have a ring isomorphism. If f corresponds to A and g to B, then 
we can apply g to both sides of (1) to obtain 

n n n n 

GF (v5) = S Qi; s, beiVE = Oe ij Dki)UR- (2) 

i=l k= 1 k=1 i=l 
If R is commutative, then a;j;by; = bysajj, and the matrix corresponding to gf = go f is 
BA, as we had hoped. In the noncommutative case, we will not be left empty-handed if 
we define the opposite ring R°, which has exactly the same elements as R and the same 
addition structure. However, multiplication is done backwards, i.e., ab in R° is ba in R. 
It is convenient to attach a superscript ° to the elements of R°, so that 


a°b° = ba (more precisely, a°b° = (ba)°). 


Thus in (2) we have a;;bei = by,a?;. To summarize, 


The endomorphism ring Endr(M) is isomorphic to the ring of n x n matrices 
with coefficients in the opposite ring R°. If R is commutative, then Endr(M) 
is ring-isomorphic to M,,(R). 


4.4.2. Preparation For The Smith Normal Form 


We now set up some basic machinery to be used in connection with the Smith normal form 
and its applications. Assume that M is a free Z-module of rank n, with basis 71,...,2n, 
and that K is a submodule of M with finitely many generators u1,...,Um. (We say that 
K is finitely generated.) We change to a new basis y1,...,Yn via Y = PX, where X 
[resp. Y] is a column vector with components x; |resp. y;]. Since X and Y are bases, the 
n Xn matrix P must be invertible, and we need to be very clear on what this means. 
If the determinant of P is nonzero, we can construct P~', for example by the “adjoint 
divided by determinant” formula given in Cramer’s rule. But the underlying ring is Z, 
not Q, so we require that the coefficients of P~! be integers. (For a more transparent 
equivalent condition, see Problem 1.) Similarly, we are going to change generators of K 
via V = QU, where Q is an invertible m x m matrix and U is a column vector with 
components U;. 

The generators of K are linear combinations of basis elements, so we have an equation 
of the form U = AX, where A is an m x n matrix called the relations matrix. Thus 


V =QU =QAX = QAP'’Y. 


so the new relations matrix is 
B=QAP"!. 


Thus B is obtained from A by pre-and postmultiplying by invertible matrices, and we 
say that A and B are equivalent. We will see that two matrices are equivalent iff they 
have the same Smith normal form. The point we wish to emphasize now is that if we 
know the matrix P, we can compute the new basis Y, and if we know the matrix Q, 
we can compute the new system of generators V. In our applications, P and Q will be 
constructed by elementary row and column operations. 
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Problems For Section 4.4 


1. Show that a square matrix P over the integers has an inverse with integer entries if 
and only if P is unimodular, that is, the determinant of P is +1. 

2. Let V be the direct sum of the R-modules Vj,...,V,, and let W be the direct sum of 
R-modules Wj,..., Wm. Indicate how a module homomorphism from V to W can be 
represented by a matrix. (The entries of the matrix need not be elements of R.) 


3. Continuing Problem 2, show that if V” is the direct sum of n copies of the R-module 
V, then we have a ring isomorphism 


Endr(V") = M,,(Endr(V)). 


4. Show that if R is regarded as an R-module, then Endr(R) is isomorphic to the opposite 
ring R°. 

5. Let R be a ring, and let f € Endr(R). Show that for some r € R we have f(x) = ar 
for alla € R. 


6. Let M be a free R-module of rank n. Show that Endry(M) = M,,(R°), a ring isomor- 
phism. 

7. Continuing Problem 6, if R is commutative, show that the ring isomorphism is in fact 
an R-algebra isomorphism. 


4.5 Smith Normal Form 


We are going to describe a procedure that is very similar to reduction of a matrix to 
echelon form. The result is that every matrix over a principal ideal domain is equivalent 
to a matrix in Smith normal form. Explicitly, the Smith matrix has nonzero entries only 
on the main diagonal. The main diagonal entries are, from the top, a,,...,a, (possibly 
followed by zeros), where the a; are nonzero and a; divides a;+, for all é. 

We will try to convey the basic ideas via a numerical example. This will allow us to 
give informal but convincing proofs of some major theorems. A formal development is 
given in Jacobson, Basic Algebra I, Chapter 3. All our computations will be in the ring 
of integers, but we will indicate how the results can be extended to an arbitrary principal 
ideal domain. Let’s start with the following matrix: 


0 0 22 0 
—2 2 -6 -4 
2 2 6 8 


As in (4.4.2), we assume a free Z-module M with basis 1, 72, 73,4, and a submodule K 
generated by uy, U2, ug, where uy = 2243, ug = —241 + 2x2 — 6x3 — 404, U3 = 24, + 242 + 
6%3+824. The first step is to bring the smallest positive integer to the 1-1 position. Thus 
interchange rows 1 and 3 to obtain 


2 2 6 8 
—2 2 -6 -4 
0 0 22 0 
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Since all entries in column 1, and similarly in row 1, are divisible by 2, we can pivot about 
the 1-1 position, in other words, use the 1-1 entry to produce zeros. Thus add row 1 to 
row 2 to get 


6 
0 
22 


Add —1 times column 1 to column 2, then add —3 times column 1 to column 3, and add 
—4 times column 1 to column 4. The result is 


2 0 0 O 
0 4 0 4 
0 0 22 0 


Now we have “peeled off” the first row and column, and we bring the smallest positive 
integer to the 2-2 position. It’s already there, so no action is required. Furthermore, the 
2-2 element is a multiple of the 1-1 element, so again no action is required. Pivoting about 
the 2-2 position, we add —1 times column 2 to column 4, and we have 

2 0 0 0 

0 4 0 0 

0 0 22 0 
Now we have peeled off the first two rows and columns, and we bring the smallest positive 
integer to the 3-3 position; again it’s already there. But 22 is not a multiple of 4, so we 
have more work to do. Add row 3 to row 2 to get 


2 0 0 0 
0 4 22 0 
0 0 22 0 


Again we pivot about the 2-2 position; 4 does not divide 22, but if we add —5 times 
column 2 to column 38, we have 


20 0 O 

04 2 O 

0 0 22 0 
Interchange columns 2 and 3 to get 

2 0 0 0 

0 2 4 0 

0 22 0 0 
Add —11 times row 2 to row 3 to obtain 

20 0 O 

02 4 O 

0 0 —44 0 
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Finally, add —2 times column 2 to column 3, and then (as a convenience to get rid of the 
minus sign) multiply row (or column) 3 by —1; the result is 


2 0 0 O 
0 2 0 0 
0 0 44 0 


which is the Smith normal form of the original matrix. Although we had to backtrack to 
produce a new pivot element in the 2-2 position, the new element is smaller than the old 
one (since it is a remainder after division by the original number). Thus we cannot go 
into an infinite loop, and the algorithm will indeed terminate in a finite number of steps. 
In view of (4.4.2), we have the following interpretation. 

We have a new basis yj, y2, y3, y4 for M, and new generators v1, v2,v3 for K, where 
v1 = 2y1, v2 = 2y2, and v3 = 44y3. In fact since the v;’s are nonzero multiples of the 
corresponding y;’s, they are linearly independent, and consequently form a basis of K. 
The new basis and set of generators can be expressed in terms of the original sets; see 
Problems 1-3 for the technique. 

The above discussion indicates that the Euclidean algorithm guarantees that the Smith 
normal form can be computed in finitely many steps. Therefore the Smith procedure can 
be carried out in any Euclidean domain. In fact we can generalize to a principal ideal 
domain. Suppose that at a particular stage of the computation, the element a occupies 
the 1-1 position of the Smith matrix S,, and the element 6 is in row 1, column 2. To use a 
as a pivot to eliminate b, let d be the greatest common divisor of a and b, and let r and s 
be elements of R such that ar + bs = d (see (2.7.2)). We postmultiply the Smith matrix 
by a matrix T of the following form (to aid in the visualization, we give a concrete 5 x 5 
example): 


r b/d 0 0 0 
s —a/d 0 0 0 
0 0 1 0 0 
0 0 0 1 0 
0 0 0 0 1 


The 2 x 2 matrix in the upper left hand corner has determinant —1, and is therefore 
invertible over R. The element in the 1-1 position of ST is ar + bs = d, and the element 
in the 1-2 position is ab/d — ba/d = 0, as desired. We have replaced the pivot element a 
by a divisor d, and this will decrease the number of prime factors, guaranteeing the finite 
termination of the algorithm. Similarly, if b were in the 2-1 position, we would premultiply 
S by the transpose of T; thus in the upper left hand corner we would have 


bee ae 


Problems For Section 4.5 
1. Let A be the matrix 
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over the integers. Find the Smith normal form of A. (It is convenient to begin by 
adding column 2 to column 1.) 


2. Continuing Problem 1, find the matrices P and Q, and verify that QAP7! is the Smith 
normal form. 


3. Continuing Problem 2, if the original basis for M is {21, 22,23} and the original set of 
generators of K is {u,,u2,u3}, find the new basis and set of generators. 


It is intuitively reasonable, but a bit messy to prove, that if a matrix A over a PID 
is multiplied by an invertible matrix, then the greatest common divisor of all the z x 7 
minors of A is unchanged. Accept this fact in doing Problems 4 and 5. 


4. The nonzero components a; of the Smith normal form S of A are called the invariant 
factors of A. Show that the invariant factors of A are unique (up to associates). 


5. Show that two m xn matrices are equivalent if and only if they have the same invariant 
factors, i.e. (by Problem 4), if and only if they have the same Smith normal form. 


6. Recall that when a matrix over a field is reduced to row-echelon form (only row opera- 
tions are involved), a pivot column is followed by non-pivot columns whose entries are 
zero in all rows below the pivot element. When a similar computation is carried out 
over the integers, or more generally over a Euclidean domain, the resulting matrix is 
said to be in Hermite normal form. We indicate the procedure in a typical example. 
Let 


6 4 13 5 
A=|9 6 0 7 
12 8 -1 12 


Carry out the following sequence of steps: 


. Add —1 times row 1 to row 2 

. Interchange rows 1 and 2 

. Add —2 times row 1 to row 2, and then add —4 times row 1 to row 3 
. Add —1 times row 2 to row 3 

. Interchange rows 2 and 3 

. Add —3 times row 2 to row 3 

. Interchange rows 2 and 3 

. Add —4 times row 2 to row 3 


. Add 5 times row 2 to row 1 (this corresponds to choosing 0,1,...,m—1 asa 
complete system of residues mod m) 


10. Add 2 times row 3 to row 1, and then add row 3 to row 2 


O© AN OTK Wn er 


We now have reduced A to Hermite normal form. 


7. Continuing Problem 6, consider the simultaneous equations 


6a + 4y +132 =5, 9x + By =7, 12” + By — z = 12 (mod m) 


For which values of m > 2 will the equations be consistent? 


4.6. FUNDAMENTAL STRUCTURE THEOREMS 17 


4.6 Fundamental Structure Theorems 


The Smith normal form yields a wealth of information about modules over a principal 
ideal domain. In particular, we will be able to see exactly what finitely generated abelian 
groups must look like. 

Before we proceed, we must mention a result that we will use now but not prove until 
later (see (7.5.5), Example 1, and (7.5.9)). If M is a finitely generated module over a PID 
R, then every submodule of M is finitely generated. [R is a Noetherian ring, hence M is 
a Noetherian R-module.| To avoid gaps in the current presentation, we can restrict our 
attention to finitely generated submodules. 


4.6.1 Simultaneous Basis Theorem 


Let M be a free module of finite rank n > 1 over the PID R, and let K be a submodule 
of M. Then there is a basis {yi,..., Yn} for M and nonzero elements a1,...,a; € R such 
that r <n, a; divides a;+, for all 7, and {a1yi,...,@,y,} is a basis for Kk. 


Proof. This is a corollary of the construction of the Smith normal form, as explained in 
Section 4.5. d& 


4.6.2 Corollary 


Let M be a free module of finite rank n over the PID R. Then every submodule of M is 
free of rank at most n. 


Proof. By (4.6.1), the submodule K has a basis with r <n elements. & 


In (4.6.2), the hypothesis that M has finite rank can be dropped, as the following 
sketch suggests. We can well-order the generators ug of K, and assume as a transfinite 
induction hypothesis that for all 8 < a, the submodule Kg spanned by all the generators 
up to ug is free of rank at most that of M, and that if y < @, then the basis of Ky is 
contained in the basis of Kg. The union of the bases Sig of the Kg is a basis Sq for Kg. 
Furthermore, the inductive step preserves the bound on rank. This is because |.Sg| < rank 
M for all 6 < a, and |Sq| is the smallest cardinal bounded below by all |.Sg|, 6 < a. Thus 
|So| < rank M. 


4.6.3. Fundamental Decomposition Theorem 


Let M be a finitely generated module over the PID R. Then there are ideals I; = (ay), 
Ip = (a),...,In = (Qn) of R such that I, D Ig D--- D I, (equivalently, a1 | a2 |---| an) 
and 


M®R/I,®@ R/In®---@R/In. 


Thus M is a direct sum of cyclic modules. 
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Proof. By (4.3.6), M is the image of a free module R” under a homomorphism f. If K 
is the kernel of f, then by (4.6.1) we have a basis y1,...,Yn for R” and a corresponding 
basis a1y1,..-,@rY,r for K. We set a; = 0 for r<i<n. Then 


Ry, 8: ++ 8 Ryn 
Rayy1 eens Ran 


i=l 


ME RYKS 


(To justify the last step, apply the first isomorphism theorem to the map 
riyi tee trnyn 2 (riys + Raryi,.--,%nYn + Ranyn-) 
But 
Ry;/Raiy; = R/ Raj, 


as can be seen via an application of the first isomorphism theorem to the map 
rary, + Rajyy;. Thus if I; = Ra;,i=1,...,n, we have 


M2 a R/I; 
i=1 


and the result follows. & 


Remark It is plausible, and can be proved formally, that the uniqueness of invariant 
factors in the Smith normal form implies the uniqueness of the decomposition (4.6.3). 
Intuitively, the decomposition is completely specified by the sequence aj,...,dan, as the 
proof of (4.6.3) indicates. 


4.6.4 Finite Abelian Groups 


Suppose that G is a finite abelian group of order 1350; what can we say about G? In 
the decomposition theorem (4.6.3), the components of G are of the form Z/Za;, that is, 
cyclic groups of order a;. We must have a; | a;41 for all i, and since the order of a direct 
sum is the product of the orders of the components, we have a, ---a, = 1350. 

The first step in the analysis is to find the prime factorization of 1350, which is 
(2)(33) (57). One possible choice of the a; is a; = 3, a2 = 3, a3 = 150. It is convenient to 
display the prime factors of the a;, which are called elementary divisors, as follows: 


ay = 3 = 293159 
ag = 3 = 29315° 
a3 = 150 = 21315? 


Since a,a2a3 = 2'335?, the sum of the exponents of 2 must be 1, the sum of the exponents 
of 3 must be 3, and the sum of the exponents of 5 must be 2. A particular distribution 
of exponents of a prime p corresponds to a partition of the sum of the exponents. For 
example, if the exponents of p were 0, 1, 1 and 2, this would correspond to a partition 
of 4as 1+1+2. In the above example, the partitions are 1 = 1,3 =1+1+1,2=2. We 
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can count the number of abelian groups of order 1350 (up to isomorphism) by counting 
partitions. There is only one partition of 1, there are two partitions of 2 (2 and 1+ 1) 
and three partitions of 3 (3, 1+2 and1+1+1). [This pattern does not continue; there 
are five partitions of 4, namely 4, 1+3,1+1+2,1+1+1+1, 242, and seven partitions 
of 5, namely 5, 1+4,14+14+3,14+14142,141414141,142+2, 243.) We specify 
a group by choosing a partition of 1, a partition of 3 and a partition of 2, and the number 
of possible choices is (1)(3)(2) = 6. Each choice of a sequence of partitions produces a 
different sequence of invariant factors. Here is the entire list; the above example appears 
as entry (5). 


(1) a; = 21395? = 1350, G & Zi350 

(2) a, = 29395! = 5, ay = 213351 = 270, G & Zs @ Zor 

(3) a, = 293159 = 3, ay = 21325? = 450, G & Zs © Zaso 

(4) a; = 29315! = 15, ay = 213251 = 90, G & Zi5 O Zoo 

(5) a, = 293159 = 3, ay = 293159 = 3, ag = 21315? = 150, G & Zz @ Z3 @ Ziso 
(6) a, = 293159 = 3, ay = 29315) = 15, ag = 213151 = 30, G & Zs @ Zig © Zo0. 


In entry (6) for example, the maximum number of summands in a partition is 3 (= 1+ 
1+1), and this reveals that there will be three invariant factors. The partition 2=1+1 
has only two summands, and it is “pushed to the right” so that 5! appears in a2 and 
az but not a;. (Remember that we must have a, | a2 | a3.). Also, we can continue to 
decompose some of the components in the direct sum representation of G. (If m and n 
are relatively prime, then Zmn =~ Zm ® Zp by the Chinese remainder theorem.) However, 
this does not change the conclusion that there are only 6 mutually nonisomorphic abelian 
groups of order 1350. 

Before examining infinite abelian groups, let’s come back to the fundamental decom- 
position theorem. 


4.6.5 Definitions and Comments 


If x belongs to the R-module M, where R is any integral domain, then x is a torsion 
element if rx = 0 for some nonzero r € R. The torsion submodule T of M is the set of 
torsion elements. (T' is indeed a submodule; if ra = 0 and sy = 0, then rs(x+y) =0.) M 
is a torsion module if T is all of M, and M is torsion-free if T consists of 0 alone, in other 
words, rx = 0 implies that either r = 0 or x = 0. A free module must be torsion-free, 
by definition of linear independence. Now assume that R is a PID, and decompose M as 
in (4.6.3), where aj,...,a@, are nonzero and a,,, =-+-: = @) = 0. Each module R/(aj), 
1 <i<r, is torsion (it is annihilated by a;), and the R/(a;), r+ 1 <i <n, are copies 
of R. Thus 6/_,.,,R/(a;) is free. We conclude that 


(*) every finitely generated module over a PID is the direct sum of its torsion submodule 
and a free module 


and 
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(**) every finitely generated torsion-free module over a PID is free. 
In particular, a finitely generated abelian group is the direct sum of a number (possibl 
p , ys group p y 


zero) of finite cyclic groups and a free abelian group (possibly {0}). 


4.6.6 Abelian Groups Specified by Generators and Relations 


Suppose that we have a free abelian group F' with basis 71,272,273, and we impose the 
following constraints on the x;: 


224 + 2x2 + 823 => 0, —271 + 2x2 + 4x3 => 0. (1) 
What we are doing is forming a “submodule of relations” K with generators 
uy = 2%, +2%24+ 823 and ug = —22%,+2r9 +423 (2) 


and we are identifying every element in K with zero. This process yields the abelian group 
G = F/K, which is generated by 21 + K, x2 + K and 23+ K. The matrix associated 


with (2) is 
2 2 8 
—2 2 4 
and a brief computation gives the Smith normal form 
2 0 0 
0 4 O]° 


Thus we have a new basis y1, y2, y3 for F and new generators 2y1, 4y2 for K. The quotient 
group F’/K is generated by yi + K,yo+K and y3+K, with 2(y1+K) = 4(yo+K) =0+K. 
In view of (4.6.3) and (4.6.5), we must have 


F/K2=2Z,02Z,0Z. 


Canonical forms of a square matrix A can be developed by reducing the matrix xl — A 
to Smith normal form. In this case, R is the polynomial ring F'[X] where F is a field. 
But the analysis is quite lengthy, and I prefer an approach in which the Jordan canonical 
form is introduced at the very beginning, and then used to prove some basic results in 
the theory of linear operators; see Ash, A Primer of Abstract Mathematics, MAA 1998. 


Problems For Section 4.6 


1. Classify all abelian groups of order 441. 
2. Classify all abelian groups of order 40. 


3. Identify the abelian group given by generators x1, 72,23 and relations 


r+ 522 +323 = 0, 2x41 —%2+ 7x3 = 0,7 321 + 4x9 + 273 = 0. 
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4. In (4.6.6), suppose we cancel a factor of 2 in Equation (1). This changes the matrix 
associated with (2) to 
114 
-1 1 2] 


whose Smith normal form differs from that given in the text. What’s wrong? 
5. Let M, N and P be abelian groups. If M@ N = M @ P, show by example that N 
need not be isomorphic to P. 


6. In Problem 5, show that M6 N ~ M @ P does imply N = P if M, N and P are 
finitely generated. 


4.7 Exact Sequences and Diagram Chasing 


4.7.1 Definitions and Comments 


Suppose that the R-module M is the direct sum of the submodules A and B. Let f be 
the inclusion or injection map of A into M (simply the identity function on A), and let 
g be the natural projection of M on B, given by g(a+ 0) = b,a€ A, b€ B. The image 
of f, namely A, coincides with the kernel of g, and we say that the sequence 


Ape (1) 


is exact at M. A longer (possibly infinite) sequence of homomorphisms is said to be exact 
if it is exact at each junction, that is, everywhere except at the left and right endpoints, 
if they exist. 

There is a natural exact sequence associated with any module homomorphism 
g: M — N, namely 


(Ss eS pe (2) 


In the diagram, A is the kernel of g, f is the injection map, and B is the image of g. A 
five term exact sequence with zero modules at the ends, as in (2), is called a short exact 
sequence. Notice that exactness at A is equivalent to ker f = 0, i.e., injectivity of f. 
Exactness at B is equivalent to im g = B, i.e., surjectivity of g. Notice also that by the 
first isomorphism theorem, we may replace B by M/A and g by the canonical map of M 
onto M/A, while preserving exactness. 

Now let’s come back to (1), where M is the direct sum of A and B, and attach zero 
modules to produce the short exact sequence (2). If we define h as the injection of B into 
M and ¢ as the projection of M on A, we have (see (3) below) goh=1andeof =1, 
where 1 stands for the identity map. 
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The short exact sequence (2) is said to split on the right if there is a homomorphism 
h: B— M such that goh = 1, and split on the left if there is a homomorphism e: M — A 
such that eo f = 1. These conditions turn out to be equivalent, and both are equivalent 
to the statement that M is essentially the direct sum of A and B. “Essentially” means 
that not only is M isomorphic to A @® B, but f can be identified with the injection of A 
into the direct sum, and g with the projection of the direct sum on B. We will see how 
to make this statement precise, but first we must turn to diagram chasing, which is a 
technique for proving assertions about commutative diagrams by sliding from one vertex 
to another. The best way to get accustomed to the method is to do examples. We will 
work one out in great detail in the text, and there will be more practice in the exercises, 
with solutions provided. 

We will use the shorthand gf for go f and fm for f(m). 


4.7.2 The Five Lemma 


Consider the following commutative diagram with exact rows. 


If s,t,v and w are isomorphisms, so is u. (In fact, the hypotheses on s and w can be 
weakened to s surjective and w injective.) 


Proof. The two parts of the proof are of interest in themselves, and are frequently called 
the “four lemma”, since they apply to diagrams with four rather than five modules in 
each row. 


(i) If ¢ and v are surjective and w is injective, then wu is surjective. 
(ii) If s is surjective and ¢ and v are injective, then wu is injective. 


[The pattern suggests a “duality” between injective and surjective maps. This idea will be 
explored in Chapter 10; see (10.1.4).] The five lemma follows from (i) and (ii). To prove (i), 
let m’ € M’. Then g’m’ € B’, and since v is surjective, we can write g’m’ = vb for some 
be B. By commutativity of the square on the right, h’ub = whb. But h’vb = h'g’m’ = 0 
by exactness of the bottom row at B’, and we then have whb = 0. Thus hb € ker w, and 
since w is injective, we have hb = 0, so that b € kerh = img by exactness of the top row 
at B. So we can write b = gm for some m € M. Now g'm’ = vb (see above) = vgm = g'um 
by commutativity of the square MBB'M'. Therefore m’ — um € kerg’ = imf’ by 
exactness of the bottom row at M’. Let m’ — um = f'a’ for some a’ € A’. Since t 
is surjective, a’ = ta for some a € A, and by commutativity of the square AMM’ A’, 
f'ta = ufa, so m’ — um = ufa, so m’ = u(m + fa). Consequently, m’ belongs to the 
image of u, proving that wu is surjective. 

To prove (ii), suppose m € keru. By commutativity, g’wum = vgm, so vgm = 0. 
Since v is injective, gm = 0. Thus m € kerg = im /f by exactness, say m = fa. Then 
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0 = um = ufa = f’ta by commutativity. Thus ta € ker f’ = ime’ by exactness. If 
ta = e’d’, then since s is surjective, we can write d’ = sd, so ta = e’sd. By commutativity, 
e'sd = ted, so ta = ted. By injectivity of t, a = ed. Therefore m = fa = fed = 0 by 
exactness. We conclude that u is injective. d& 


4.7.3. Corollary: The Short Five Lemma 


Consider the following commutative diagram with exact rows. (Throughout this section, 
all maps in commutative diagrams and exact sequences are assumed to be R-module 
homomorphisms. ) 


If t and v are isomorphisms, so is wu. 


Proof. Apply the five lemma with C = D = C’ = D' = 0, and s and w the identity 
maps. @& 


We can now deal with splitting of short exact sequences. 


4.7.4 Proposition 
Let 


ee ees | ee ee eee 


be a short exact sequence. The following conditions are equivalent, and define a split 
exact sequence. 


(i) The sequence splits on the right. 
(ii) The sequence splits on the left. 


(iii) There is an isomorphism u of M and A@® B such that the following diagram is 
commutative. 


Thus M is isomorphic to the direct sum of A and B, and in addition, f can be 
identified with the injection i of A into A © B, and g with the projection 7 of the 
direct sum onto B. (The double vertical bars indicate the identity map.) 
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Proof. It follows from our earlier discussion of diagram (3) that (iii) implies (i) and (ii). 
To show that (i) implies (iii), let h be a homomorphism of B into M such that gh = 1. 
We claim that 


M =kerg @h(B). 


First, suppose that m € M. Write m = (m— hgm) + hgm; then hgm € hA(B) and 
g(m—hgm) = gm—ghgm = gm—1gm = gm—gm = 0. Second, suppose m € ker gNh(B), 
with m = hb. Then 0 gm ghb 1b b, som = hb = AO = 0, proving the 
claim. Now since kerg = im f by exactness, we may express any m € M in the form 
m= fa+hb. We take um = a+b, which makes sense because both f and h are injective 
and f(A)NhA(B) = 0. This forces the diagram of (iii) to be commutative, and wu is therefore 
an isomorphism by the short five lemma. Finally, we show that (ii) implies (iii). Let e be 
a homomorphism of M into A such that ef = 1. In this case, we claim that 


M = f(A) @ kere. 


If m € M then m = fem+(m-— fem) and fem € f(A), e(m— fem) = em—efem = 
em —em = 0. If m € f(A) NM kere, then, with m = fa, we have 0 = em = efa = a, 
so m = 0, and the claim is verified. Now ifm € M we have m = fa+m’ withae A 
and m’ € kere. We take u(m) = a+ g(m’) = a+ gm since gf = 0. (The definition of u 
is unambiguous because f is injective and f(A) M kere = 0.) The choice of u forces the 
diagram to be commutative, and again u is an isomorphism by the short five lemma. dé 


4.7.5 Corollary 


If the sequence 


0 he a as 0 


is split exact with splitting maps e and h as in (3), then the “backwards” sequence 


e 


0<— A<*_M<"—B<—0 
is also split exact, with splitting maps g and /f. 
Proof. Simply note that gkh=landef=1. & 

A device that I use to remember which way the splitting maps go (i.e., it’s ef = 1, 
not fe = 1) is that the map that is applied first points inward toward the “center” M. 
Problems For Section 4.7 


Consider the following commutative diagram with exact rows: 


0 >A >B >C 
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Our objective in Problems 1-3 is to find a homomorphism u: A — A’ such that the square 
ABB’ A’, hence the entire diagram, is commutative. 


1. Show that if u exists, it is unique. 
2. If a € A, show that vfa € im f’. 
3. If ufa = f'a’, define ua appropriately. 


Now consider another commutative diagram with exact rows: 


Ey | ee eee 


A 
A’ —> B’ Se! >() 
f g 


In Problems 4 and 5 we are to define wu: C — C’ so that the diagram will commute. 
4. If c € C, then since g is surjective, c = gb for some b € B. Write down the only 
possible definition of uc. 
5. In Problem 4, 6 is not unique. Show that your definition of wu does not depend on the 


particular b. 


Problems 6-11 refer to the diagram of the short five lemma (4.7.3). Application of the 
four lemma is very efficient, but a direct attack is also good practice. 
6. If t and v are injective, so is u. 
. If t and v are surjective, so is u. 


. If t is surjective and w is injective, then v is injective. 


Oo ON 


. If u is surjective, so is v. 
By Problems 8 and 9, if t and u are isomorphisms, so is v. 


10. If wu is injective, so is t. 


11. If u is surjective and v is injective, then ¢ is surjective. 
Note that by Problems 10 and 11, if uw and v are isomorphisms, so is t. 


12. If you have not done so earlier, do Problem 8 directly, without appealing to the four 
lemma. 


13. If you have not done so earlier, do Problem 11 directly, without appealing to the four 
lemma. 


Enrichment 


Chapters 1—4 form an idealized undergraduate course, written in the style of a graduate 
text. To help those seeing abstract algebra for the first time, I have prepared this section, 
which contains advice, explanations and additional examples for each section in the first 
four chapters. 


Section 1.1 


When we say that the rational numbers form a group under addition, we mean that 
rational numbers can be added and subtracted, and the result will inevitably be rational. 
Similarly for the integers, the real numbers, and the complex numbers. But the integers 
(even the nonzero integers) do not form a group under multiplication. If a is an integer 
other than +1, there is no integer b such that ab = 1. The nonzero rational numbers 
form a group under multiplication, as do the nonzero reals and the nonzero complex 
numbers. Not only can we add and subtract rationals, we can multiply and divide them 
(if the divisor is nonzero). The rational, reals and complex numbers are examples of fields, 
which will be studied systematically in Chapter 3. 

Here is what the generalized associative law is saying. To compute the product of the 
elements a,b,c,d and e, one way is to first compute bc, then (bc)d, then a((bc)d), and 
finally [a((bc)d)e]. Another way is (ab), then (cd), then (ab)(cd), and finally ([(ab)(cd)]e). 
All procedures give the same result, which can therefore be written as abcde. 

Notice that the solution to Problem 6 indicates how to construct a formal proof 
of 1.1.4. 


Section 1.2 


Groups whose descriptions differ may turn out to be isomorphic, and we already have an 
example from the groups discussed in this section. Consider the dihedral group De, with 
elements I, R, R?, F, RF, R?F. Let $3 be the group of all permutations of {1,2,3}. We 
claim that Dg and $3 are isomorphic. This can be seen geometrically if we view Dg as a 
group of permutations of the vertices of an equilateral triangle. Since Dg has 6 elements 
and there are exactly 6 permutations of 3 symbols, we must conclude that Dg and S3 
are essentially the same. To display an isomorphism explicitly, let R correspond to the 


1 


permutation (1,2,3) and F' to (2,3). Then 
I=(1), R=(1,2,3), R? = (1,3,2), F= (2,3), RF =(1,2), R°F = (1,3). 


If G is a nonabelian group, then it must have an element of order 3 or more. (For 
example, 53 has two elements of order 3.) In other words, if every element of G has order 
1 or 2, then G is abelian. To prove this, let a,b € G; we will show that ab = ba. We can 
assume with loss of generality that a #1 and b# 1. But then a? = 1 and b? = 1, so that 
a is its own inverse, and similarly for b. If ab has order 1, then ab = 1, so a and 6 are 
inverses of each other. By uniqueness of inverses, a = b, hence ab = ba. If ab has order 2, 
then abab = 1, so ab = b~'a~! = ba. 


Section 1.3 


Here is another view of cosets that may be helpful. Suppose that a coded message is to 
be transmitted, and the message is to be represented by a code word « (an n-dimensional 
vector with components in some field). The allowable code words are solutions of Ax = 0, 
where A is an m by n matrix, hence the set H of code words is an abelian group under 
componentwise addition, a subgroup of the abelian group G of all n-dimensional vectors. 
(In fact, G and H are vector space, but let’s ignore the additional structure.) Transmission 
is affected by noise, so that the received vector is of the form z = x+y, where y is another 
n-dimensional vector, called the error vector or error pattern vector. Upon receiving z, 
we calculate the syndrome s = Az. If s turns out to be the zero vector, we declare that no 
error has occurred, and the transmitted word is z. Of course our decision may be incorrect, 
but under suitable assumptions about the nature of the noise, our decision procedure will 
minimize the probability of making a mistake. Again, let’s ignore this difficulty and focus 
on the algebraic aspects of the problem. We make the following claim: 

Two vectors z; and zg have the same syndrome if and only if they lie in the same coset 
of H inG. 

To prove this, observe that Az; = Az iff A(z] — 22) = 0 iff 21) — zo € A iff 2 © 22+ H. 
(We are working in an abelian group, so we use the additive notation z2 + H rather than 
the multiplicative notation z2H.) 

Now suppose that we agree that we are going to correct the error pattern y;, in other 
words, if we receive z = 2, + y1, where x; is a code word, we will decode z as x1. If 
we receive 2’ = x, + y1, where x‘, is another code word, we decode z’ as x. Thus 
our procedure corrects yi regardless of the particular word transmitted. Here is a key 
algebraic observation: 

If y; and yz are distinct vectors that lie in the same coset, it is impossible to correct 
both YW and Y2.- 

This holds because y; = yo + x for some code word x ¥ 0, hence y, +0 = y24+ &. 
Therefore we cannot distinguish between the following two possibilities: 


1. The zero word is transmitted and the error pattern is y1; 


2. x is transmitted and the error pattern is yz. 


It follows that among all vectors in a given coset, equivalently among all vectors 
having the same syndrome, we can choose exactly one as a correctable error pattern. If 


the underlying field has only two elements 0 and 1, then (under suitable assumptions) it is 
best to choose to correct the pattern of minimum weight, that is, minimum number of 1’s. 
In particular, if the coset is the subgroup H itself, then we choose the zero vector. This 
agrees with our earlier proposal: if the received vector z has zero syndrome, we decode 
z as z itself, thus “correcting” the zero pattern, in other words, declaring that there has 
been no error in transmission. 

For further discussion and examples, see Information Theory by R. B. Ash, Dover 
1991, Chapter 4. 


Section 1.4 


Here are some intuitive ideas that may help in visualizing the various isomorphism theo- 
rems. In topology, we can turn the real interval [0, 1] into a circle by gluing the endpoints 
together, in other words identifying 0 and 1. Something similar is happening when we 
form the quotient group G/N where N is a normal subgroup of G. We have identified 
all the elements of N, and since the identity belongs to every subgroup, we can say that 
we have set everything in N equal to 1 (or 0 in the abelian case). Formally, (1.4.6) gives 
a correspondence between the subgroup of G/N consisting of the identity alone, and the 
subgroup N of G. 

We have already seen an example of this identification process. In (1.2.4), we started 
with the free group G generated by the symbols R and F, and identified all sequences 
satisfying the relations R” = I, F? = I, and RF = FR™ (equivalently RFRF = 1). 
Here we would like to take N to be the subgroup of G generated by R”, F?, and RFRF, 
but N might not be normal. We will get around this technical difficulty when we discuss 
generators and relations in more detail in Section 5.8. 


Section 1.5 


Direct products provide a good illustration of the use of the first isomorphism theorem. 
Suppose that G = H x K; what can we say about G/H? If (h,k) € G, then (h,k) = 
(h, 1x«)(1H,&), and intuitively we have identified (h, 1x) with the identity (ly, 1x). What 
we have left is (1q7,), and it appears that G/H should be isomorphic to kK. To prove 
this formally, define f: G— K by f(h,k) =k. Then f is an epimorphism whose kernel 
is {(h, lx): h € H}, which can be identified with H. By the first isomorphism theorem, 
G/H& K. 


Section 2.1 


Here is an interesting ring that will come up in Section 9.5 in connection with group 
representation theory. Let G = {21,...%m} be a finite group, and let R be any ring. 
The group ring RG consists of all elements rjx1 + +--+ + fmm. Addition of elements 
is componentwise, just as if the x; were basis vectors of a vector space and the r; were 
scalars in a field. Multiplication in RG is governed by the given multiplication in R, along 


with linearity. For example, 
2 2 
(rya1 + re%eq)($101 + $2%2) = 1181 2j +:17182%102 + 72812221 + 1282N5. 


The elements x7, 2122, %2%1, and x2 belong to G, which need not be abelian. The elements 
7181, 7182, T7281, and r2S2 belong to R, which is not necessarily commutative. Thus it is 
essential to keep track of the order in which elements are written. 


Section 2.2 


Here is some additional practice with ideals in matrix rings. If I is an ideal of M,,(R), 
we will show that J must have the form M,,(Jo) for some unique ideal Ip of R. [M;,(Jo) is 
the set of all n by n matrices with entries in Jo.| 

We note first that for any matrix A, we have Ej; AE, = ajrLu. This holds because 
E,;A puts row j of A in row i, and AE, puts column k of A in column J. Thus Fy; AE, 
puts aj, in the il position, with zeros elsewhere. 

If I is an ideal of M,,(R), let Ip be the set of all entries a1, where A = (a;;) is a matrix 
in I. To verify that Jp is an ideal, observe that (A+ B)i1 = a1. + 011, cay = (CE A)ui, 
and aytc = (AFiic)11. We will show that J = M,, (Io). 

If A € I, set i = 1 = 1 in the basic identity involving the elementary matrices Ej; 
(see the second paragraph above) to get aj,£1; € I. Thus aj, € Io for all j and k, so 
A€é M, (Ip). 

Conversely, let A € M,(Jo), so that aj € Jo for all i,l. By definition of Jp, there 
is a matrix B € I such that db); = ay. Take 7 = k = 1 in the basic identity to get 
Ey, BEY = bi Ey = ayy. Consequently, aij € I for alli and /. But the sum of the 
matrices a; fj, over all i and | is simply A, and we conclude that A € I. 

To prove uniqueness, suppose that M,,(Io) = M,(11). Ifa € Ip, then aE, € M,,(Io) = 
M,(11), so a € 1. A symmetrical argument completes the proof. 


Section 2.3 


If a and 6 are relatively prime integers, then a’ and b/ are relatively prime for all positive 
integers 7 and j. Here is an analogous result for ideals. Suppose that the ideals 1, and Ig 
of the ring R are relatively prime, so that I, + Iz = R. Let us prove that I? and Iz are 
relatively prime as well. By the definitions of the sum and product of ideals, we have 


R=RR=(+ L(+ h)=P+hbh+hLh+BcP+hcr 


so R = I? + Iz, as asserted. Similarly, we can show that R = I? + Iz by considering the 
product of I? + Iz and I; + Iz. More generally, an induction argument shows that if 


h+--+i,=R, 
then for all positive integers m1,...,™n we have 


TP te +L = R. 


Section 2.4 


We have defined prime ideals only when the ring R is commutative, and it is natural to ask 
why this restriction is imposed. Suppose that we drop the hypothesis of commutativity, 
and define prime ideals as in (2.4.4). We can then prove that if P is a prime ideal, J and J 
are arbitrary ideals, and P D IJ, then either P > I or P D J. [If the conclusion is false, 
there are elements a¢€ IJ\ Pandbe J\ P. Thenabe lJ CP, butag Pandb¢ Pia 
contradiction.| 

If we try to reverse the process and show that a proper ideal P such that P D IJ 
implies P > I or P D J must be prime, we run into trouble. If ab belongs to P, then the 
principal ideal (ab) is contained in P. We would like to conclude that (a)(b) C P, so that 
(a) C P or (b) C P, in other words, a € P or b € P. But (ab) need not equal (a)(b). For 
example, to express the product of the element ar € (a) and the element sb € (b) asa 
multiple of ab, we must invoke commutativity. 

An explicit example: Let P be the zero ideal in the ring M,,(R) of n by n matrices over 
a division ring R (see Section 2.2, exercises). Since M,(R) has no nontrivial two-sided 
ideals, P D> IJ implies P > I or P D J. But ab € P does not imply a € P or bE P, 
because the product of two nonzero matrices can be zero. 

This example illustrates another source of difficulty. The zero ideal P is maximal, 
but M,,(R)/P is not a division ring. Thus we cannot generalize (2.4.3) by dropping 
commutativity and replacing “field” by “division ring”. [If R/M is a division ring, it does 
follow that M is a maximal ideal; the proof given in (2.4.3) works.] 


Section 2.5 


Let’s have a brief look at polynomials in more than one variable; we will have much 
more to say in Chapter 8. For example, a polynomial f(X, Y, Z) in 3 variables is a sum of 
monomials; a monomial is of the form aX*Y/ Z" where a belongs to the underlying ring R. 
The degree of such a monomial is 1+7+h, and the degree of f is the maximum monomial 
degree. Formally, we can define R[X,Y] as (R[X])[Y], R[X, Y, Z] as (RX, Y])[Z], etc. 
Let f be a polynomial of degree n in F [X,Y], where F is a field. There are many cases 
in which f has infinitely many roots in F’. For example, consider f(X,Y) = X + Y over 
the reals. The problem is that there is no direct extension of the division algorithm (2.5.1) 
to polynomials in several variables. The study of solutions to polynomial equations in 
more than one variable leads to algebraic geometry, which will be introduced in Chapter 8. 


Section 2.6 


We have shown in (2.6.8) that every principal ideal domain is a unique factorization 
domain. Here is an example of a UFD that is not a PID. Let R = Z[X], which will be 
shown to be a UFD in (2.9.6). Let J be the maximal ideal < 2, X > (see (2.4.8)). If I is 
principal, then J consists of all multiples of a polynomial f(X) with integer coefficients. 
Since 2 € I, we must be able to multiply f(X) = a9 +a,X +---+a,X” by a polynomial 
g(X) = bo +b, X +-+-+byX'™ and produce 2. There is no way to do this unless f(X) = 1 
or 2. But if f(X) = 1 then I = R, a contradiction (a maximal ideal must be proper). 


Thus f(X) = 2, but we must also be able to multiply f(X) by some polynomial in Z[X] 
to produce X. This is impossible, and we conclude that J is not principal. 

A faster proof that Z[X] is not a PID is as follows. In (2.4.8) we showed that < 2 > 
and < X > are prime ideals that are not maximal, and the result follows from (2.6.9). 
On the other hand, the first method produced an explicit example of an ideal that is not 
principal. 


Section 2.7 


It may be useful to look at the Gaussian integers in more detail, and identify the primes 
in Z|]. To avoid confusion, we will call a prime in Z[i] a Gaussian prime and a prime 
in Z a rational prime. Anticipating some terminology from algebraic number theory, we 
define the norm of the Gaussian integer a + bi as N(a+ bi) = a? + b?. We will outline 
a sequence of results that determine exactly which Gaussian integers are prime. We use 
Greek letters for members of Z[i] and roman letters for ordinary integers. 


1. ais a unit in Z[i] iff N(aw) = 1. Thus the only Gaussian units are +1 and +i. 


(If a8 = 1, then 1 = N(1) = N(a)N(8), so both N(a) and N(G) must be 1.] 
Let n be a positive integer. 


2. If n is a Gaussian prime, then n is a rational prime not expressible as the sum of two 
squares. 


[n is a rational prime because any factorization in Z is also a factorization in Z[i]. If 
n=2?+y? =(x+iy)(x — iy), then either x + iy or  — iy is a unit. By (1),n=1,a 
contradiction.| 

Now assume that n is a rational prime but not a Gaussian prime. 


3. If a=a-+ bi is a nontrivial factor of n, then gcd(a, b) = 1. 


(If the greatest common divisor d is greater than 1, then d = n. Thus a divides n and n 
divides a, so n and a@ are associates, a contradiction. ] 


4. nis a sum of two squares. 


[Let n = (a+ bi)(c+ di); since n is real we have ad+ bc = 0, so a divides bc. By (3), a and 

b are relatively prime, so a divides c, say c = ka. Then b(ka) = bc = —ad, so d = —Dbk. 

Thus n = ac — bd = ka? + kb? = k(a? + 67). But a + bi is a nontrivial factor of n, so 

a? +b? = N(a+bi) > 1. Since n is a rational prime, we must have k = 1 and n = a? +0?.] 
By the above results, we have: 


5. If n is a positive integer, then n is a Gaussian prime if and only if n is a rational prime 
not expressible as the sum of two squares. 


Now assume that a = a+ bi is a Gaussian integer with both a and b nonzero. (The cases 
in which a or 6 is 0 are covered by (1) and (5).) 


6. If N(q) is a rational prime, then a is a Gaussian prime. 


[If a = Gy where @ and ¥ are not units, then N(a) = N(3)N(y), where N(@) and N(7) 
are greater than 1, contradicting the hypothesis.] 
Now assume that a is a Gaussian prime. 


7. If N(a) = hk is a nontrivial factorization, so that h > 1 and k > 1, then a divides 
either h or k. If, say, a divides h, then so does its complex conjugate @. 

[We have N(a) = a? + b? = (a+ bi)(a — bi) = aa = hk. Since a divides the product hk, 

it must divide one of the factors. If a3 = h, take complex conjugates to conclude that 

ab =h] 

8. N(q) is a rational prime. 


[If not, then we can assume by (7) that a and @ divide h. If a and @ are not associates, 
then N(a) = aq divides h, so hk divides h and therefore k = 1, a contradiction. If a and 
its conjugate are associates, then one is +7 times the other. The only way this can happen 
is if a = y(1 4%) where 7 is a unit. But then N(a) = N(y)N(14i) = N(L +7) =2,a 
rational prime.] 

By the above results, we have: 


9. Ifa =a+bi with a £0, b #0, then a is a Gaussian prime if and only if N(a) is a 
rational prime. 


The assertions (5) and (9) give a complete description of the Gaussian primes, except 
that it would be nice to know when a rational prime p can be expressed as the sum of 
two squares. We have 2 = 17+ 17, so 2 is not a Gaussian prime, in fact 2 = (1+7)(1—1). 
If p is an odd prime, then p is a sum of two squares iff p = 1 mod 4, as we will prove at 
the beginning of Chapter 7. Thus we may restate (5) as follows: 

10. If n is a positive integer, then n is a Gaussian prime iff n is a rational prime congruent 

to 3 mod 4. 

[Note that a number congruent to 0 or 2 mod 4 must be even.] 


Section 2.8 


Suppose that R is an integral domain with quotient field F’', and g is aring homomorphism 
from R to an integral domain R’. We can then regard g as mapping R into the quotient 
field F’ of R’. It is natural to try to extend g to a homomorphism 9g: F — F’. Ifa,be R 
with b # 0, then a = b(a/b), so we must have g(a) = g(b)g(a/b). Thus if an extension 
exists, it must be given by 


G(a/b) = g(a)[g(b)}~. 

For this to make sense, we must have g(b) # 0 whenever b 4 0, in other words, g is a 
monomorphism. [Note that if x,y € Rand g(x) = g(y), then g(a—y) = 0, hence x—y = 0, 
so x = y.] We will see in (3.1.2) that any homomorphism of fields is a monomorphism, so 
this condition is automatically satisfied. We can establish the existence of g by defining it 
as above and then showing that it is a well-defined ring homomorphism. This has already 
been done in Problem 8. We are in the general situation described in Problem 7, with S 
taken as the set of nonzero elements of R. We must check that g(s) is a unit in F” for 
every s € S, but this holds because g(s) is a nonzero element of F’. 


Section 2.9 


Here is another useful result relating factorization over an integral domain to factorization 
over the quotient field. Suppose that f is a monic polynomial with integer coefficients, 
and that f can be factored as gh, where g and A are monic polynomials with rational 
coefficients. Then g and h must have integer coefficients. More generally, let D be a 
unique factorization domain with quotient field F’, and let f be a monic polynomial in 
D[X]. If f = gh, with g, h © F[X], then g and h must belong to D[X]. 

To prove this, we invoke the basic proposition (2.9.2) to produce a nonzero \ € F 
such that Ag € D[X] and \~'h € D[X]. But g and h are monic, so \ = 1 and the result 
follows. 

Let f be a cubic polynomial in F(X]. If f is reducible, it must have a linear factor and 
hence a root in F’. We can check this easily if F' is a finite field; just try all possibilities. A 
finite check also suffices when F' = Q, by the rational root test (Section 2.9, Problem 1). 
If g is a linear factor of f, then f/g = h is quadratic. We can factor h as above, and in 
addition the quadratic formula is available if square roots can be extracted in F’. In other 
words, if a € F, then b? = a for some b € F. 


Section 3.1 


All results in this section are basic and should be studied carefully. You probably have 
some experience with polynomials over the rational numbers, so let’s do an example with 
a rather different flavor. Let F = Fy. be the field with two elements 0 and 1, and let 
f € F[X] be the polynomial X? + X + 1. Note that f is irreducible over F, because if f 
were factorable, it would have a linear factor and hence a root in F’. This is impossible, 
as f(0) = f(1) = 14 0. If we adjoin a root a of f to produce an extension F(a), we 
know that f is the minimal polynomial of a over F’, and that F(a) consists of all elements 
bo + b1.a, with bp and b; in F’. Since bp and b, take on values 0 and 1, we have constructed 
a field F(a) with 4 elements. Moreover, all nonzero elements of F(a) can be expressed 
as powers of a, as follows: 

a = 1,a'=a,a7 =-a-—1=1+a. (The last equality follows because 1 +1 =0 
in F.) 

This is a typical computation involving finite fields, which will be studied in detail in 
Chapter 6. 


Section 3.2 


We found in Problem 3 that a splitting field for X+— 2 has degree 8 over Q. If we make a 
seemingly small change and consider f(X) = X*—1, the results are quite different. The 
roots of f are 1, 7, —1 and —7. Thus Q(¢) is the desired splitting field, and it has degree 2 
over Q because the minimal polynomial of 7 over Q has degree 2. 

A general problem suggested by this example is to describe a splitting field for X” —1 
over Q for an arbitrary positive integer n. The splitting field is Q(w), where w is a 
primitive n*” root of unity, for example, w = e!2*/". We will see in Section 6.5 that the 
degree of Q(w) over Q is y(n), where y is the Euler phi function. 


Section 3.3 


In Problem 8 we used the existence of an algebraic closure of F' to show that any set of 
nonconstant polynomials in FLX] has a splitting field over F’. Conversely, if we suppose 
that it is possible to find a splitting field K for an arbitrary family of polynomials over 
the field F’, then the existence of an algebraic closure of F’ can be established quickly. 
Thus let K be a splitting field for the collection of all polynomials in FX], and let C 
be the algebraic closure of F' in K (see (3.3.4)). Then by definition, C is an algebraic 
extension of F' and every nonconstant polynomial in FLX] splits over C. By (3.3.6), C is 
an algebraic closure of F’. 


Section 3.4 


Let’s have another look at Example 3.4.8 with p = 2 to get some additional practice with 
separability and inseparability. We have seen that \/t is not separable over F, in fact it is 
purely inseparable because its minimal polynomial X? — ¢ can be written as (X — \/t)?. 
But if we adjoin a cube root of t, the resulting element \/t is separable over F', because 
X3— t has nonzero derivative, equivalently does not belong to F[X?] (see 3.4.3). 

Notice also that adjoining Vt and V/t is equivalent to adjoining ¥/t, in other words, 
F(t, Vt) = F(t). To see this, first observe that ifa = V/t, then /t = a® and Wt = a’. 
On the other hand, (Vt/‘/t)® = t. 

It is possible for an element a@ to be both separable and purely inseparable over F’, but 
it happens if and only if a belongs to F’. The minimal polynomial of a over F’' must have 
only one distinct root and no repeated roots, so min(a, F) = X — a. But the minimal 
polynomial has coefficients in F' (by definition), and the result follows. 


Section 3.5 


Suppose we wish to find the Galois group of the extension E/F’, where E = F(a). Assume 
that a@ is algebraic over F with minimal polynomial f, and that f has n distinct roots 
Q1 = Q,Q2,...,Q@, in some splitting field. If o € Gal(E/F), then o permutes the roots 
of f by (3.5.1). Given any two roots a; and a;, i € j, we can find an F-isomorphism 
that carries a; into a;; see (3.2.3). Do not jump to the conclusion that all permutations 
are allowable, and therefore Gal(E/F’) is isomorphic to S,,. For example, we may not be 
able to simultaneously carry a, into ag and az into ay. Another difficulty is that the 
F-isomorphism carrying a; into a; need not be an F-automorphism of E. This suggests 
that normality of the extension is a key property. If E/F is the non-normal extension of 
Example (3.5.10), the only allowable permutation is the identity. 


Section 4.1 


Finitely generated algebras over a commutative ring R frequently appear in applications 
to algebraic number theory and algebraic geometry. We say that A is a finitely generated 
R-algebra if there are finitely many elements 71,...,%» in A such that every element of 
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Aisa polynomial f(21,...,%n) with coefficients in R. Equivalently, A is a homomorphic 
image of the polynomial ring R[X1,...,X;]. The homomorphism is determined explicitly 
by mapping X; to a;, i = 1,...,n. The polynomial f(X1,...,X»n) is then mapped to 
f(x, sae Sdn): 


If every element is not just a polynomial in the x; but a linear combination of the x; 
with coefficients in R, then A is a finitely generated module over R. To see the difference 
clearly, look at the polynomial ring R[X], which is a finitely generated R algebra. (In the 
above discussion we can take n = 1 and 2; = X.) But if fi,..., f, are polynomials in 
R[X] and the maximum degree of the f; is m, there is no way to take linear combinations 
of the f; and produce a polynomial of degree greater than m. Thus R[X] is a not a finitely 
generated R-module. 


Section 4.2 


Here is some practice working with quotient modules. Let N be a submodule of the 
R-module M, and let 7 be the canonical map from M onto M/N, taking x € M to 
x+N é€M/N. Suppose that N; and No are submodules of M satisfying 


(a) M < No; 
(c) (M1) = 7(N2). 


Then Ni = No. 

To prove this, let « € Na. Hypothesis (c) says that (Nj + N)/N = (No + N)/N; we 
don’t write N;/N, i = 1,2, because N is not necessarily a submodule of N; or No. Thus 
c+N €(No+N)/N =(Ni+ N)/N,sox+N =y+N for some y € Nj. By (a), y € No, 
hence x —y € NaN N= N,N by (b). Therefore x — y and y both belong to Ni, and 
consequently so does «. We have shown that Nj < N1, and in view of hypothesis (a), we 
are finished. 


Section 4.3 


If M is a free R-module with basis S = (a;), then an arbitrary function f from S to 
an arbitrary R-module N has a unique extension to an R-homomorphism f: M — N; 
see (4.3.6). 

This property characterizes free modules, in other words, if M is an R-module with a 
subset S satisfying the above property, then M is free with basis S. To see this, build a 
free module M’ with basis S’ = (y;) having the same cardinality as S. For example, we 
can take M’ to be the direct sum of copies of R, as many copies as there are elements 
of S. Define f: S > S’ C M’ by f(x;) = y%, and let f be the unique extension of f to 
an R-homomorphism from M to M’. Similarly, define g: S’ > S C M by g(y;) = %, 
and let g be the unique extension of g to an R-homomorphism from M’ to M. Note 
that g exists and is unique because M’ is free. Now go f is the identity on S, so by 
uniqueness of extensions from S$ to M, Go f is the identity on M. Similarly, f 09 is the 
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identity on M’. Thus M and M’ are not only isomorphic, but the isomorphism we have 
constructed carries S into S’. It follows that M is free with basis S. 

This is an illustration of the characterization of an algebraic object by a universal 
mapping property. We will see other examples in Chapter 10. 


Section 4.4 


Here is some practice in decoding abstract presentations. An R-module can be defined as 
a representation of R in an endomorphism ring of an abelian group M. What does this 
mean? 

First of all, for each r € R, we have an endomorphism f,. of the abelian group M, given 
by f(z) = ra,x« € M. Tosay that f, is an endomorphism is to say that r(a+y) = ra+ry, 
ryeM,reR. 

Second, the mapping r — f, is a ring homomorphism from R to Endry(M). (Such 
a mapping is called a representation of R in Endr(M).) This says that f,1.(2) = 
fr(x) + fs(x), frs(x) = fr(fs(x)), and fi(x) = x. In other words, (r +s)" = rx + sa, 
(rs)x = r(sx), and la =z. 

Thus we have found a fancy way to write the module axioms. If you are already 
comfortable with the informal view of a module as a “vector space over a ring”, you are 
less likely to be thrown off stride by the abstraction. 


Section 4.5 


The technique given in Problems 1-3 for finding new bases and generators is worth em- 
phasizing. We start with a matrix A to be reduced to Smith normal form. The equations 
U = AX give the generators U of the submodule K in terms of the basis X of the free 
module M. The steps in the Smith calculation are of two types: 


1. Premultiplication by an elementary row matrix R. This corresponds to changing gen- 
erators via V = RU. 


2. Postmultiplication by an elementary column matrix C. This corresponds to changing 
bases via Y = C71 X. 


Suppose that the elementary row matrices appearing in the calculation are R,,..., Rs, 
in that order, and the elementary column matrices are C),...,C;, in that order. Then 
the matrices Q and P are given by 


Q=R,-:+RoRi, Po =CiCy---C; 


hence P = C;'---Cy'Cj'. The final basis for M is Y = PX, and the final generating 
set for K is V = QU = SY, where S = QAP™! is the Smith normal form (see 4.4.2). 


Section 4.6 


Here is a result that is used in algebraic number theory. Let G be a free abelian group 
of rank n, and H a subgroup of G. By the simultaneous basis theorem, there is a basis 
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Y1,---Yn Of G and there are positive integers a,,...a;, 7 <n, such that a; divides aj41 
for all i, and ayy1,...,@ryr is a basis for H. We claim that the abelian group G/H is 
finite if and only if r = n, and in this case, the size of G/H is |G/H| = aja2---a,. 

To see this, look at the proof of (4.6.3) with R” replaced by G and K by H. The 
argument shows that G/H is the direct sum of cyclic groups Z/Za;, i = 1,...,n, with 
a; = 0 for r <i <n. In other words, G/H is the direct sum of r finite cyclic groups (of 
order a1,...,@, respectively) and n — r copies of Z. The result follows. 

Now assume that r = n, and let 7,...,@, and 21,...,2n be arbitrary bases for G 
and H respectively. Then each z; is a linear combination of the x; with integer coefficients; 
in matrix form, z = Az. We claim that |G/H| is the absolute value of the determinant 
of A. To verify this, first look at the special case x; = y; and 2; = ajyj, i = 1,...,n. 
Then A is a diagonal matrix with entries a;, and the result follows. But the special case 
implies the general result, because any matrix corresponding to a change of basis of G 
or H is unimodular, in other words, has determinant +1. (See Section 4.4, Problem 1.) 


Section 4.7 


Here is some extra practice in diagram chasing. The diagram below is commutative with 
exact rows. 


A B C 0 

a ae 

A’ —> B' > C’ 0 
f g 


If t and wu are isomorphisms, we will show that v is also an isomorphism. (The hypothesis 
on t can be weakened to surjectivity.) 

Let c¢ € C’; then c’ = g'b’ for some b’ € B’. Since u is surjective, g'b’ = g'ub for some 
be B. By commutativity, g'ub = vgb, which proves that v is surjective. 

Now assume uc = 0. Since g is surjective, c= gb for some b € B. By commutativity, 
vgb = g'ub = 0. Thus ub € kerg’ = im f’, so ub = f’a’ for some a’ € A’. Since t is 
surjective, f’a’ = f’ta for some a € A. By commutativity, f’ta = ufa. We now have 
ub = ufa, so b— fa € keru, hence b = fa because u is injective. Consequently, 


c=gb=gfa=0 


which proves that v is injective. 


Chapter - 5 


>» Some Basic Techniques 
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Chapter 5 


Some Basic Techniques of 
Group Theory 


5.1 Groups Acting on Sets 


In this chapter we are going to analyze and classify groups, and, if possible, break down 
complicated groups into simpler components. To motivate the topic of this section, let’s 
look at the following result. 


5.1.1 Cayley’s Theorem 


Every group is isomorphic to a group of permutations. 


Proof. The idea is that each element g in the group G corresponds to a permutation of 
the set G itself. If « € G, then the permutation associated with g carries x into ga. If 
gx = gy, then premultiplying by g~! gives x = y. Furthermore, given any h € G, we can 
solve gz = h for x. Thus the map x — gz is indeed a permutation of G. The map from 
g to its associated permutation is injective, because if gx = ha for all x € G, then (take 
x =1)g=h. In fact the map is a homomorphism, since the permutation associated with 
hg is multiplication by hg, which is multiplication by g followed by multiplication by h, 
hog for short. Thus we have an embedding of G into the group of all permutations of 
the set G. & 


In Cayley’s theorem, a group acts on itself in the sense that each g yields a permutation 
of G. We can generalize to the notion of a group acting on an arbitrary set. 


5.1.2 Definitions and Comments 


The group G acts on the set X if for each g € G there is a mapping x — gx of X into 
itself, such that 
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(1) h(gx) = (hg)x for every g,h EG 
(2) la =a for every rE X. 


As in (5.1.1), « — ga defines a permutation of X. The main point is that the action 
of g is a permutation because it has an inverse, namely the action of g~*. (Explicitly, the 
inverse of r — gx is y ~ g-'y.) Again as in (5.1.1), the map from g to its associated 
permutation ®(g) is a homomorphism of G into the group Sx of permutations of X. But 
we do not necessarily have an embedding. If gx = hz for all x, then in (5.1.1) we were 
able to set x = 1, the identity element of G, but this resource is not available in general. 

We have just seen that a group action induces a homomorphism from G to Sx, and 
there is a converse assertion. If ® is a homomorphism of G to Sx, then there is a 
corresponding action, defined by ga = ®(g)a,x € X. Condition (1) holds because ® is a 
homomorphism, and (2) holds because ©(1) must be the identity of Sx. The kernel of ® 
is known as the kernel of the action; it is the set of all g € G such that gx = x for all a, 
in other words, the set of g’s that fix everything in X. 


5.1.38 Examples 


1. (The regular action) Every group acts on itself by multiplication on the left, as 
in (5.1.1). In this case, the homomorphism ® is injective, and we say that the action is 
faithful. 

(Similarly, we can define an action on the right by (ag)h = x(gh), «1 = x, and then G 
acts on itself by right multiplication. The problem is that ®(gh) = ®(h) o ®(g), an 
antihomomorphism. The damage can be repaired by writing function values as xf rather 
than f(x), or by defining the action of g to be multiplication on the right by g~t. We 
will avoid the difficulty by restricting to actions on the left.] 

2. (The trivial action) We take gx = x for all g € G, x € X. This action is highly 
unfaithful. 

3. (Conjugation on elements) We use the notation ge x for the action of g on x, and 
we set ge x = gxg—', called the conjugate of x by g, for g and x in the group G. Since 
hgxg th~+ = (hg)a(hg)~+ and 1z1~! = 2, we have a legal action of G on itself. The 
kernel is 
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{g: gxg° = for all x}, that is, {g: gx = xg for all x}. 


Thus the kernel is the set of elements that commute with everything in the group. This 
set is called the center of G, written Z(G). 

4. (Conjugation on subgroups) If H is a subgroup of G, we take ge H = gHg"!. 
Note that gHg~' is a subgroup of G, called the conjugate subgroup of H by g, since 
ghig-‘ghog' = g(hih2)g~' and (ghg~!)~! = gh~'g~t. As in Example (3), we have a 
legal action of G on the set of subgroups of G. 

5. (Conjugation on subsets) This is a variation of the previous example. In this case 
we let G act by conjugation on the collection of all subsets of G, not just subgroups. The 
verification that the action is legal is easier, because gHg~! is certainly a subset of G. 

6. (Multiplication on left cosets) Let G act on the set of left cosets of a fixed sub- 
group H by ge (aH) = (gx)H. By definition of set multiplication, we have a legitimate 
action. 
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7. (Multiplication on subsets) Let G act on all subsets of G by ge S = gS = {gr: x € 
S}. Again the action is legal by definition of set multiplication. 


Problems For Section 5.1 


1. Let G act on left cosets of H by multiplication, as in Example 6. Show that the kernel 
of the action is a subgroup of H. 


2. Suppose that H is a proper subgroup of G of index n, and that G is a simple group, that 
is, G has no normal subgroups except G itself and {1}. Show thatG can be embedded 
in Sy. 

3. Suppose that G is an infinite simple group. Show that for every proper subgroup 1 
of G, the index [G : H] is infinite. 

4. Let G act on left cosets of H by multiplication. Show that the kernel of the action is 


N= () cHax!. 
xEG 


5. Continuing Problem 4, if K is any normal subgroup of G contained in H, show that 
Kk < WN. Thus N is the largest normal subgroup of G contained in H; N is called the 
core of H inG. 


6. Here is some extra practice with left cosets of various subgroups. Let H and Kk be 
subgroups of G, and consider the map f which assigns to the coset g(H MK) the pair 
of cosets (gH, gK). Show that f is well-defined and injective, and therefore 


IG: HK) < |G: Al[G: kK}. 


Thus (Poincaré) the intersection of finitely many subgroups of finite index also has 
finite index. 


7. If |G: H] and [G: K] are finite and relatively prime, show that the inequality in the 
preceding problem is actually an equality. 


8. Let H be a subgroup of G of finite index n, and let G act on left cosets cH by 
multiplication. Let N be the kernel of the action, so that N < H by Problem 1. Show 
that [G : N] divides nl. 

9. Let H be a subgroup of G of finite index n > 1. If |G| does not divide n!, show that G 
is not simple. 
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5.2 The Orbit-Stabilizer Theorem 


5.2.1 Definitions and Comments 


Suppose that the group G acts on the set X. If we start with the element « € X and 
successively apply group elements in all possible ways, we get 


B(x) = {gz: g € G} 


which is called the orbit of x under the action of G. The action is transitive (we also say 
that G acts transitively on X) if there is only one orbit, in other words, for any x,y € X, 
there exists g € G such that gx = y. Note that the orbits partition X, because they are 
the equivalence classes of the equivalence relation given by y ~ « iff y = ga for some 
g EG. 

The stabilizer of an element x € X is 


G(a) = {9 €G: gu =a}, 


the set of elements that leave x fixed. A direct verification shows that G(x) is a subgroup. 
This is a useful observation because any set that appears as a stabilizer in a group action 
is guaranteed to be a subgroup; we need not bother to check each time. 

Before proceeding to the main theorem, let’s return to the examples considered in 
(5.1.3). 


5.2.2. Examples 


1. The regular action of G on G is transitive, and the stabilizer of x is the subgroup {1}. 

2. The trivial action is not transitive (except in trivial cases), in fact, B(a) = {x} for 
every x. The stabilizer of x is the entire group G. 

3. Conjugation on elements is not transitive (see Problem 1). The orbit of x is the set 
of conjugates gxg~' of x, that is, 


B(x) = {gxg*: 9 € G}, 
which is known as the conjugacy class of x. The stabilizer of x is 
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G(x) = {g: gxg_ = x} = {g: gx = xg}, 


the set of group elements that commute with x. This set is called the centralizer of x, 
written C¢(x). Similarly, the centralizer Cg¢(S) of an arbitrary subset S C G is defined 
as the set of elements of G that commute with everything in S. (Here, we do need to 
check that C@(S) is a subgroup, and this follows because Ce@(S) = (peg Ca(2).) 

4. Conjugation on subgroups is not transitive. The orbit of H is {gHg~!: g € G}, 
the collection of conjugate subgroups of H. The stabilizer of H is 


{g: gHg"' = H}, 


which is called the normalizer of H, written Nc(H). If K is asubgroup of G containing H, 
we have 


H<K iff gHg"' =H for every g€ K 
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and this holds iff K is a subgroup of Nc(H). Thus Ne(#) is the largest subgroup of G 
in which 4 is normal. 

5. Conjugation on subsets is not transitive, and the orbit of the subset S is {gSg 
g € G}. The stabilizer of S is the normalizerNg(S) = {g: gSg~* = S}. 

6. Multiplication on left cosets is transitive; a solution of g(aH) = yH for x is x = 
g ty. The stabilizer of rH is 


-1. 


{g: gtH = 2H} ={g: 2 ‘gt € H} ={g9:9¢€ 2He '}=2Hx", 


the conjugate of H by x. Taking 7 = 1, we see that the stabilizer of H is H itself. 
7. Multiplication on subsets is not transitive. The stabilizer of S is {g: gS = S}, the 
set of elements of G that permute the elements of S. 


5.2.3. The Orbit-Stabilizer Theorem 


Suppose that a group G acts on a set X. Let B(x) be the orbit of « € X, and let G(z) 
be the stabilizer of x. Then the size of the orbit is the index of the stabilizer, that is, 


|B(x)| =[G: G(@)]. 


Thus if G is finite, then |B(x)| = |G|/|G(~)]; in particular, the orbit size divides the order 
of the group. 


Proof. If y belongs to the orbit of x, say y = gx. We take f(y) = gH, where H = G(z) is 
the stabilizer of z. To check that f is a well-defined map of B(x) to the set of left cosets 
of A, let y = giz = gox. Then Jo ix = £, 8o 92°91 € A, ie., 9H = goH. Since g 
is an arbitrary element of G, f is surjective. If g1H = goH, then 991 € H, so that 
Jo ix = x, and consequently gia = goa. Thus if y: = gia, yo = goa, and f(y1) = f(y2), 
then y, = yo, proving f injective. & 


Referring to (5.2.2), Example 3, we see that B(x) is an orbit of size 1 iff z commutes 
with every g € G, ie., x € Z(G), the center of G. Thus if G is finite and we select one 
element x; from each conjugacy class of size greater than 1, we get the class equation 


|G] = |2(G)| + DIG: Ca(ai)]. 


We know that a group G acts on left cosets of a subgroup K by multiplication. To 
prepare for the next result, we look at the action of a subgroup H of G on left cosets of K. 
Since K is a left coset of K, it has an orbit given by {hk: h € H}. The union of the 
sets hK is the set product Hk. The stabilizer of K is not K itself, as in Example 6; it is 
{he H: hK = Kk}. But hK = K(= 14K) if and only if h € K, so the stabilizer is HN K. 


5.2.4 Proposition 


If H and K are subgroups of the finite group G, then 


| ||K| 


AK| = ———. 
cae ln aa 04 
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Proof. The cosets in the orbit of K are disjoint, and each has |K| members. Since, as 
remarked above, the union of the cosets is HK, there must be exactly |H K|/|K| cosets 
in the orbit. Since the index of the stabilizer of K is |H/H1 K|, the result follows from 
the orbit-stabilizer theorem. dé 


Problems For Section 5.2 


1. Let o be the permutation (1,2,3,4,5) and 7 the permutation (1, 2)(3,4). Then ro7~?, 
the conjugate of o by 7, can be obtained by applying 7 to the symbols of a to get 
(2,1,4,3,5). Reversing the process, if we are given rT = (1,2)(3,4) and we specify that 
utp * = (1,3)(2,5), we can take w = [132343]. This suggests that two permutations 
are conjugate if and only if they have the same cycle structure. Explain why this works. 


2. Show that if S is any subset of G, then the centralizer of S is a normal subgroup of 
the normalizer of S. (Let the normalizer Ng(S) act on S by conjugation on elements.) 


3. Let G(x) be the stabilizer of under a group action. Show that stabilizers of elements 
in the orbit of x are conjugate subgroups. Explicitly, for every g © G and x € X we 
have 


G(gx) = gG(a)g*. 


4. Let G act on the set X. Show that for a given  € X, U(gG(x)) = gz is a well-defined 
injective mapping of the set of left cosets of G(a) into X, and is bijective if the action 
is transitive. 


5. Continuing Problem 4, let G act transitively on X, and choose any x € X. Show that 
the action of G on X is essentially the same as the action of G on the left cosets of 
the stabilizer subgroup G(a). This is the meaning of the assertion that “any transitive 
G-set is isomorphic to a space of left cosets”. Give an appropriate formal statement 
expressing this idea. 


6. Suppose that G is a finite group, and for every x,y € G such that r A landy#l, x 
and y are conjugate. Show that the order of G must be 1 or 2. 


7. First note that if r is a positive rational number and & a fixed positive integer, there 
are only finitely many positive integer solutions of the equation 


Outline of proof: If x, is the smallest x;, the left side is at most k/x,, so 1 < ap < k/r 
and there are only finitely many choices for x,. Repeat this argument for the equation 
1 


feos gets cee 
T T — 


x4 Lk—-1 Cp” 

Now set r = 1 and let N(k) be an upper bound on all the 2;’s in all possible solutions. 
If G is a finite group with exactly k conjugacy classes, show that the order of G is at 
most N(k). 
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5.3 Application To Combinatorics 


The theory of group actions can be used to solve a class of combinatorial problems. To set 
up a typical problem, consider the regular hexagon of Figure 5.3.1, and recall the dihedral 
group Dj2, the group of symmetries of the hexagon (Section 1.2). 


ra | | % 
ae J 1 
5 ——— 6 

Figure 5.3.1 


If R is rotation by 60 degrees and F is reflection about the horizontal line joining 
vertices 1 and 4, the 12 members of the group may be listed as follows. 
Peidentity: A =(1,2/3,4,5,6),- R= (13,5) Ab), 


RS = (1,4)(2,5)(3,6), R4 = (1,5,3)(2,6,4), R° = (1,6,5,4,3,2) 
F = (2,6)(3,5), RF = (1,2)(3,6)(4,5), R°F = (1,3)(4,6) 
RSF = (1,4)(2,3)(5,6), RAF =(1,5)(2,4), ROF = (1,6)(2,5)(3, 4). 


(As before, RF means F followed by R.) 

Suppose that we color the vertices of the hexagon, and we have n colors available (we 
are not required to use every color). How many distinct colorings are there? Since we 
may choose the color of any vertex in n ways, a logical answer is n°. But this answer 
does not describe the physical situation accurately. To see what is happening, suppose 
we have two colors, yellow (Y) and blue (B). Then the coloring 


1 2 3 4 5 6 


CL PB. eo ye oe 


is mapped by RF to 


1 2 3 4 5 6 


C= pepBYY Y 


(For example, vertex 3 goes to where vertex 6 was previously, delivering the color yellow 
to vertex 6.) According to our counting scheme, C2 is not the same as C). But imagine 
that we have two rigid necklaces in the form of a hexagon, one colored by C and the 
other by C2. If both necklaces were lying on a table, it would be difficult to argue that 
they are essentially different, since one can be converted to a copy of the other simply by 
flipping it over and then rotating it. 

Let’s try to make an appropriate mathematical model. Any group of permutations of 
aset X acts on X in the natural way: ge a = g(x). In particular, the dihedral group G 
acts on the vertices of the hexagon, and therefore on the set S of colorings of the vertices. 
The above discussion suggests that colorings in the same orbit should be regarded as 
equivalent, so the number of essentially different colorings is the number of orbits. The 
following result will help us do the counting. 
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5.3.1 Orbit-Counting Theorem 


Let the finite group G act on the finite set X, and let f(g) be the number of elements of 
X fixed by g, that is, the size of the set {v € X: g(a) = x}. Then the number of orbits is 


al: 


gEG 


the average number of points left fixed by elements of G. 


Proof. We use a standard combinatorial technique called “counting two ways”. Let T be 
the set of all ordered pairs (g,x) such that g € G,x € X, and gx = x. For any « € X, 
the number of g’s such that (g, xz) € T is the size of the stabilizer subgroup G(x), hence 


IT| = S> IG). (1) 


cEX 


Now for any g € G, the number of 2’s such that (g,2) € T is f(g), the number of fixed 
points of g. Thus 


IT| = > f(g): (2) 


gEG 


Divide (1) and (2) by the order of G to get 


IG@)| _ 
Xa) Tai 2a (3) 


cEX 


But by the orbit-stabilizer theorem (5.2.3), |G|/|G(x)| is |B(x)|,the size of the orbit of x. 
If, for example, an orbit has 5 members, then 1/5 will appear 5 times in the sum on the 
left side of (3), for a total contribution of 1. Thus the left side of (3) is the total number 
of orbits. & 


We can now proceed to the next step in the analysis. 


5.3.2 Counting the Number of Colorings Fixed by a Given Per- 
mutation 


Let 7 = R? = (1,3,5)(2,4,6). Since 7(1) = 3 and (3) = 5, vertices 1,3 and 5 have the 
same color. Similarly, vertices 2,4 and 6 must have the same color. If there are n colors 
available, we can choose the color of each cycle in n ways, and the total number of choices 
is n?. If t = F = (2,6)(3,5), then as before we choose 1 color out of n for each cycle, but 
in this case we still have to color the vertices 1 and 4. Here is a general statement that 
covers both situations. 

If w has c cycles, counting cycles of length 1, then the number of colorings fixed by 7 


is n°. 
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To emphasize the need to consider cycles of length 1, we can write F’ as (2,6)(3,5)(1)(4). 
From the cycle decompositions given at the beginning of the section, we have one per- 
mutation (the identity) with 6 cycles, three with 4 cycles, four with 3 cycles, two with 2 
cycles, and two with 1 cycle. Thus the number of distinct colorings is 


1 
rial + 3n4 + 4n3 + 2n? + 2n). 


5.3.3 A Variant 


We now consider a slightly different question. How many distinct colorings of the vertices 
of a regular hexagon are there if we are forced to color exactly three vertices blue and 
three vertices yellow? The group G is the same as before, but the set S is different. Of 
the 64 possible colorings of the vertices, only (§) = 20 are legal, since 3 vertices out 
of 6 are chosen to be colored blue; the other vertices must be colored yellow. If 7 is a 
permutation of G, then within each cycle of a, all vertices have the same color, but in 
contrast to the previous example, we do not have a free choice of color for each cycle. To 
see this, consider R? = (1,3,5)(2,4,6). The cycle (1,3,5) can be colored blue and (2, 4,6) 
yellow, or vice versa, but it is not possible to color all six vertices blue, or to color all 
vertices yellow. Thus f(R?) = 2. If t = F = (2,6)(3,5)(1)(4), a fixed coloring is obtained 
by choosing one of the cycles of length 2 and one of the cycles of length 1 to be colored 
blue, thus producing 3 blue vertices. Consequently, f(f’) = 4. To obtain f(Z), note that 
all legal colorings are fixed by I, so f(Z) is the number of colorings of 6 vertices with 
exactly 3 blue and 3 yellow vertices, namely, ($) = 20. From the cycle decompositions 
of the members of G, there are two permutations with f = 2, three with f = 4, and one 
with f = 20; the others have f = 0. Thus the number of distinct colorings is 


1 
9 22) + 3(4) + 20) = 3. 


Problems For Section 5.3 


1. Assume that two colorings of the vertices of a square are equivalent if one can be 
mapped into the other by a permutation in the dihedral group G = Dg. If n colors 
are available, find the number of distinct colorings. 


2. In Problem 1, suppose that we color the sides of the square rather than the vertices. 
Do we get the same answer? 


3. In Problem 1, assume that only two colors are available, white and green. There 
are 16 unrestricted colorings, but only 6 equivalence classes. List the equivalence 
classes explicitly. 


4. Consider a rigid rod lying on the x-axis from x = —1 to x = 1, with three beads 
attached. The beads are located at the endpoints (—1,0) and (1,0), and at the center 
(0,0). The beads are to be painted using n colors, and two colorings are regarded 
as equivalent if one can be mapped into the other by a permutation in the group 
G = {I,c}, where o is the 180 degree rotation about the vertical axis. Find the 
number of distinct colorings. 


10 
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In Problem 4, find the number of distinct colorings if the color of the central bead is 
always black. 


. Consider the group of rotations of the regular tetrahedron (see Figure 5.3.2); G con- 


sists of the following permutations. 
(i) The identity; 


(ii) Rotations by 120 degrees, clockwise or counterclockwise, about an axis through 
a vertex and the opposite face. There are 8 such rotations (choose 1 of 4 vertices, 
then choose a clockwise or counterclockwise direction); 


(iii) Rotations by 180 degrees about the line joining the midpoints of two nontouching 
edges. There are 3 such rotations. 


Argue geometrically to show that there are no other rotations in the group, and show 
that G is isomorphic to the alternating group Ag. 


Figure 5.3.2 


. Given n colors, find the number of distinct colorings of the vertices of a regular 


tetrahedron, if colorings that can be rotated into each other are equivalent. 


. In Problem 7, assume that n = 4 and label the colors B,Y,W,G. Find the number of 


distinct colorings if exactly two vertices must be colored B. 


. The group G of rotations of a cube consists of the following permutations of the 


faces. 
(i) The identity. 


(ii) Rotations of +90 or 180 degrees about a line through the center of two opposite 
faces; there are 3 x 3 = 9 such rotations. 


Sa 


Rotations of +120 degrees about a diagonal of the cube, i.e., a line joining two 
opposite vertices (vertices that are a maximal distance apart). There are 4 
diagonals, so there are 4 x 2 = 8 such rotations. 


(iii 


WK 


Rotations of 180 degrees about a line joining the midpoints of two opposite edges. 
There are 6 such rotations. (An axis of rotation is determined by selecting one 
of the four edges on the bottom of the cube, or one of the two vertical edges on 
the front face.) 


(iv 


Argue geometrically to show that there are no other rotations in the group, and show 
that G is isomorphic to the symmetric group S4. 
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10. If six colors are available and each face of a cube is painted a different color, find the 
number of distinct colorings. 


11. Let G be the group of rotations of a regular p-gon, where p is an odd prime. If the 
vertices of the p-gon are to be painted using at most n colors, find the number of 
distinct colorings. 


12. Use the result of Problem 11 to give an unusual proof of Fermat’s little theorem. 


5.4 The Sylow Theorems 


Considerable information about the structure of a finite group G can be obtained by 
factoring the order of G. Suppose that |G| = p"m where p is prime, r is a positive integer, 
and p does not divide m. Then r is the highest power of p that divides the order of G. 
We will prove, among other things, that G must have a subgroup of order p”, and any 
two such subgroups must be conjugate. We will need the following result about binomial 
coefficients. 


5.4.1 Lemma 


nm 
r 


If n = p'm where p is prime, then (,") =m mod p. Thus if p does not divide m, then it 
does not divide @ es i; 


Proof. By the binomial expansion modulo p (see Section 3.4), which works for polynomials 
as well as for field elements, we have 


(X +1)?" = XP" 41?" = XP" +1 mod p. 
Raise both sides to the power m to obtain 
(XAT = OP -4 1) nod. p. 


On the left side, the coefficient of X?" is (,"), and on the right side, it is (,/"1) = m. 
The result follows. & 


5.4.2 Definitions and Comments 


Let p be a prime number. The group G is said to be a p-group if the order of each element 
of G is a power of p. (The particular power depends on the element.) If G is a finite 
group, then G is a p-group iff the order of G is a power of p. [The “if” part follows 
from Lagrange’s theorem, and the “only if” part is a corollary to the Sylow theorems; 
see (5.4.5).| 

If |G| = p"m, where p does not divide m, then a subgroup P of G of order p” is called 
a Sylow p-subgroup of G. Thus P is a p-subgroup of G of maximum possible size. 
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5.4.38 The Sylow Theorems 


Let G be a finite group of order p’m, where p is prime, r is a positive integer, and p does 
not divide m. Then 


(1) G has at least one Sylow p-subgroup, and every p-subgroup of G is contained in a 
Sylow p-subgroup. 

(2) Let n, be the number of Sylow p-subgroups of G. Then n, = 1 mod p and np di- 
vides m. 


(3) All Sylow p-subgroups are conjugate. Thus if we define an equivalence relation on 
subgroups by H ~ K iff H = gKg~' for some g € G, then the Sylow p-subgroups 
comprise a single equivalence class. [Note that the conjugate of a Sylow p-subgroup 
is also a Sylow p-subgroup, since it has the same number of elements p” .] 


Proof. (1) Let G act on subsets of G of size p” by left multiplication. The number of 


such subsets is ( a i: which is not divisible by p by (5.4.1). Consequently, since orbits 


partition the set acted on by the group, there is at least one subset S whose orbit size 
is not divisible by p. If P is the stabilizer of S, then by (5.2.3), the size of the orbit 
is [G : P|] = |G|/|P| = p’m/|P|. For this to fail to be divisible by p, we must have 
p'||P|, and therefore p™ < |P|. But for any fixed x € S, the map of P into S given by 
g — gu is injective. (The map is indeed into S because g belongs to the stabilizer of S, 
so that gS = S.) Thus |P| < |S| = p". We conclude that |P| = p”, hence P is a Sylow 
p-subgroup. 

So far, we have shown that a Sylow p-subgroup P exists, but not that every p-subgroup 
is contained in a Sylow p-subgroup. We will return to this in the course of proving (2) 
and (3). 

(2) and (3) Let X be the set of all Sylow p-subgroups of G. Then |X| = n, and P 
acts on X by conjugation, ic., ge Q = gQg-',g € P. By (5.2.3), the size of any orbit 
divides |P| = p”, hence is a power of p. Suppose that there is an orbit of size 1, that 
is, a Sylow p-subgroup Q € X such that gQg~! = Q, and therefore gQ = Qq, for every 
g € P. (There is at least one such subgroup, namely P.) Then PQ = QP, so by (1.3.6), 
PQ = (P,Q,), the subgroup generated by P and Q. Since |P| = |Q| = p” it follows from 
(5.2.4) that |PQ] is a power of p, say p°. We must have c < r because PQ is a subgroup 
of G (hence |PQ| divides |G|). Thus 


p = |P|)<|PQ| <p", so |P|=|PQ|= 2". 


But P is a subset of PQ, and since all sets are finite, we conclude that P = PQ, and 
therefore Q C P. Since both P and Q are of size p", we have P = Q. Thus there is only 
one orbit of size 1, namely {P}. Since by (5.2.3), all other orbit sizes are of the form p* 
where c > 1, it follows that n, = 1 mod p. 

Now let R be a psubgroup of G, and let R act by multiplication on Y, the set of 
left cosets of P. Since |Y| = [G : P] = |G|/|P| = p'm/p"” = m, p does not divide |Y]. 
Therefore some orbit size is not divisible by p. By (5.2.3), every orbit size divides |RI, 
hence is a power of p. (See (5.4.5) below. We are not going around in circles because 
(5.4.4) and (5.4.5) only depend on the existence of Sylow subgroups, which we have already 
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established.) Thus there must be an orbit of size 1, say {gP} with g 6 G. Ifhe Rthen 
hgP = gP, that is, g-thg € P, or equally well, h € gPg~'. Consequently, R is contained 
in a conjugate of P. If R is a Sylow p-subgroup to begin with, then R is a conjugate of 
P, completing the proof of (1) and (3). 

To finish (2), we must show that n, divides m. Let G act on subgroups by conjugation. 
The orbit of P has size n, by (3), so by (5.2.3), n, divides |G| = p’m. But p cannot be 
a prime factor of np, since n» = 1 mod p. It follows that n, must divide m. d& 


5.4.4 Corollary (Cauchy’s Theorem) 


If the prime p divides the order of G, then G has an element of order p. 


Proof. Let P be a Sylow p-subgroup of G, and pick x € P with « £ 1. The order of z is 
a power of p, say |z| = p*. Then x?” " has order p. @& 


5.4.5 Corollary 


The finite group G is a p-group if and only if the order of G is a power of p. 


Proof. If the order of G is not a power of p, then it is divisible by some other prime gq. But 
in this case, G has a Sylow qg-subgroup, and therefore by (5.4.4), an element of order gq. 
Thus G cannot be a p-group. The converse was done in (5.4.2). & 


Problems For Section 5.4 


1. Under the hypothesis of the Sylow theorems, show that G has a subgroup of index np. 


2. Let P be a Sylow p-subgroup of the finite group G, and let Q be any p-subgroup. If Q 
is contained in the normalizer Ng(P), show that PQ is a p-subgroup. 


3. Continuing Problem 2, show that Q is contained in P. 


4. Let P be a Sylow p-subgroup of the finite group G, and let H be a subgroup of G that 
contains the normalizer Ng(P). 


(a) If g € N@(HA), show that P and gPg~! are Sylow p-subgroups of H, hence they 
are conjugate in H. 


(b) Show that Ne(H) = H. 


5. Let P be a Sylow p-subgroup of the finite group G, and let N be a normal subgroup 
of G. Assume that p divides |N| and |G/N|, so that N and G/N have Sylow p- 
subgroups. Show that [PN : P| and p are relatively prime, and then show that PM N 
is a Sylow p-subgroup of N. 

6. Continuing Problem 5, show that PN/N is a Sylow p-subgroup of G/N. 

7. Suppose that P is the unique Sylow p-subgroup of G. (Equivalently, Pis a normal 


Sylow p-subgroup of G; see (5.5.4).] Show that for each automorphism f of G, we have 
f(P) = P. [Thus P is a characteristic subgroup of G; see (5.7.1).] 
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8. The Sylow theorems are about subgroups whose order is a power of a prime p. Here is 
a result about subgroups of index p. Let H be a subgroup of the finite group G, and 
assume that [G : H] = p. Let N be a normal subgroup of G such that N < H and 
[G : N] divides p! (see Section 5.1, Problem 8). Show that [H : N] divides (p — 1)!. 

9. Continuing Problem 8, let H be a subgroup of the finite group G, and assume that H 
has index p, where p is the smallest prime divisor of |G|. Show that H < G. 


5.5 Applications Of The Sylow Theorems 


The Sylow theorems are of considerable assistance in the problem of classifying, up to 
isomorphism, all finite groups of a given order n. But in this area, proofs tend to involve 
intricate combinatorial arguments, best left to specialized texts in group theory. We will 
try to illustrate some of the basic ideas while keeping the presentation clean and crisp. 


5.5.1 Definitions and Comments 


A group G is simple if G # {1} and the only normal subgroups of G are G itself and {1}. 
We will see later that simple groups can be regarded as building blocks for arbitrary finite 
groups. Abelian simple groups are already very familiar to us; they are the cyclic groups 
of prime order. For if « € G, « 4 1, then by simplicity (and the fact that all subgroups of 
an abelian group are normal), G = (x). If G is not of prime order, then G has a nontrivial 
proper subgroup by (1.1.4), so G cannot be simple. 

The following results will be useful. 


5.5.2 Lemma 


If H and K are normal subgroups of G and the intersection of H and K is trivial (i-e., {1}), 
then hk = kh for every h€ H andke k. 


Proof. We did this in connection with direct products; see the beginning of the proof 
of (1.5.2). & 


5.5.3 Proposition 


If P is a nontrivial finite p-group, then P has a nontrivial center. 


Proof. Let P act on itself by conjugation; see (5.1.3) and (5.2.2), Example 3. The orbits 
are the conjugacy classes of P. The element x belongs to an orbit of size 1 iff x is in 
the center Z(P), since grg~! = = for all g € P iff gx = xg for all g € P iff x € Z(P). 
By the orbit-stabilizer theorem, an orbit size that is greater than 1 must divide |P|, and 
therefore must be a positive power of p. If Z(P) = {1}, then we have one orbit of size 1, 
with all other orbit sizes = 0 mod p. Thus |P| = 1 mod p, contradicting the assumption 
that P is a nontrivial p-group. & 
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5.5.4 Lemma 


P is a normal Sylow p-subgroup of G if and only if P is the unique Sylow p-subgroup 
of G. 


Proof. By Sylow (3), the Sylow p-subgroups form a single equivalence class of conjugate 
subgroups. This equivalence class consists of a single element {P} iff gPg~! = P for 
every g€G, that is, iff PIG. &h 


5.5.5 Proposition 


Let G be a finite, nonabelian simple group. If the prime p divides the order of G, then 
the number n, of Sylow p-subgroups of G is greater than 1. 


Proof. If p is the only prime divisor of |G|, then G is a nontrivial p-group, hence Z(G) 
is nontrivial by (5.5.3). Since Z(G) <I G (see (5.1.3), Example 3), Z(G) = G, so that G 
is abelian, a contradiction. Thus |G] is divisible by at least two distinct primes, so if P 
is a Sylow p-subgroup, then {1} < P < G. Ifn, = 1, then there is a unique Sylow 
p-subgroup P, which is normal in G by (5.5.4). This contradicts the simplicity of G, so 
we must have np > 1. & 


We can now derive some properties of groups whose order is the product of two distinct 
primes. 


5.5.6 Proposition 


Let G be a group of order pq, where p and q are distinct primes. 


(i) If g#1 mod p, then G has a normal Sylow p-subgroup. 
(ii) G is not simple. 
(iii) If p #1 mod g and gq #1 mod p, then G is cyclic. 


Proof. (i) By Sylow (2), n» = 1 mod p and ny|q, so np = 1. The result follows from 
(5.5.4). 

(ii) We may assume without loss of generality that p > q. Then p cannot divide q—1, 
so g#1 mod p. By (i), G has a normal Sylow p-subgroup, so G is not simple. 

(iii) By (i), G has a normal Sylow p-subgroup P and a normal Sylow q-subgroup Q. 
Since P and Q are of prime order (p and gq, respectively), they are cyclic. If x generates P 
and y generates Q, then ry = yx by (5.5.2). [P and Q have trivial intersection because 
any member of the intersection has order dividing both p and q.] But then xy has order 
pq = |G| (see Section 1.1, Problem 8). Thus G= (zy). & 


We now look at the more complicated case |G] = p?q. The combinatorial argument in 
the next proof is very interesting. 
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5.5.7 Proposition 


Suppose that the order of the finite group G is pq, where p and q are distinct primes. 
Then G has either a normal Sylow p-subgroup or a normal Sylow q-subgroup. Thus G is 
not simple. 


Proof. If the conclusion is false then n, and n, are both greater than 1. By Sylow (2), ng 
divides p*, so ng = p or p”, and we will show that the second case leads to a contradiction. 
A Sylow q-subgroup Q is of order q and is therefore cyclic. Furthermore, every element 
of Q except the identity is a generator of Q. Conversely, any element of order q generates 
a Sylow q-subgroup. Since the only divisors of g are 1 and q, any two distinct Sylow 
g-subgroups have trivial intersection. Thus the number of elements of G of order q is 
exactly ng(q — 1). If ng = p*, then the number of elements that are not of order q is 


pq—p'(q—-1) =p. 


Now let P be any Sylow p-subgroup of G. Then |P| = p?, so no element of P can 
have order q (the orders must be 1, p or p?). Since there are only p? elements of order 
unequal to q available, P takes care of all of them. Thus there cannot be another Sylow p- 
subgroup, so n, = 1, a contradiction. We conclude that ng must be p. Now by Sylow (2), 
Nq = 1 mod q, hence p= 1 mod q, so p > q. But n, divides q, a prime, so np = q. Since 
Np = 1 mod p, we have q = 1 mod p, and consequently q > p. Our original assumption 
that both n, and ng are greater than one has led inexorably to a contradiction. de 


Problems For Section 5.5 


. Show that every group of order 15 is cyclic. 
. If G/Z(G) is cyclic, show that G = Z(G), and therefore G is abelian. 


. Show that for prime p, every group of order p? is abelian. 


Pe WwW NY 


. Let G be a group with |G| = pqr, where p, q and r are distinct primes and (without 
loss of generality) p > q > r. Show that |G| > 1+n,(p—1)+ng(q—-1) +n,(r —- 1). 


5. Continuing Problem 4, if G is simple, show that np, ng and n, are all greater than 1. 
Then show that np = qr, ng > p and n, > gq. 


6. Show that a group whose order is the product of three distinct primes is not simple. 


7. Let G be a simple group of order p"m, where r > 1, m > 1, and the prime p does 
not divide m. Let n = n, be the number of Sylow p-subgroups of G. If H = Ng(P), 
where P is a Sylow p-subgroup of G, then [|G : H] = n (see Problem 1 of Section 5.4). 
Show that P cannot be normal in G (hence n > 1), and conclude that |G| must 
divide n!. 


8. If G is a group of order 250,000 = 245°, show that G is not simple. 
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5.6 Composition Series 


5.6.1 Definitions and Comments 


One way to break down a group into simpler components is via a subnormal series 
1=Go iG, 4:--IG,=G. 


“Subnormal” means that each subgroup G; is normal in its successor Gj,;. In a normal 
series, the G; are required to be normal subgroups of the entire group G. For convenience, 
the trivial subgroup {1} will be written as 1. 

Suppose that G; is not a maximal normal subgroup of G;11, equivalently (by the 
correspondence theorem) Gj+1/G; is not simple. Then the original subnormal series can 
be refined by inserting a group H such that G;«H<G;,,. We can continue refining in the 
hope that the process will terminate (it always will if G is finite). If all factors Gi41/G; 
are simple, we say that the group G has a composition series. [By convention, the trivial 
group has a composition series, namely {1} itself.] 

The Jordan-Holder theorem asserts that if G has a composition series, the resulting 
composition length r and the composition factors Gi41/G; are unique (up to isomorphism 
and rearrangement). Thus all refinements lead to essentially the same result. Simple 
groups therefore give important information about arbitrary groups; if G,; and Gz have 
different composition factors, they cannot be isomorphic. 

Here is an example of a composition series. Let 54 be the group of all permutations 
of {1,2,3,4}, and A, the subgroup of even permutations (normal in $4 by Section 1.3, 
Problem 6). Let V be the four group (Section 1.2, Problem 6; normal in Ay, in fact in S4, 
by direct verification). Let Zy be any subgroup of V of order 2. Then 


14Z2<4V 4Ag<S4. 


The proof of the Jordan-Holder theorem requires some technical machinery. 


5.6.2 Lemma 
(i) If K d H < Gand f is a homomorphism on G, then f(A) < f(#). 
(ii) If K JH <Gand NG, then NK I NH. 
(iii) If A,B,C and D are subgroups of G with A < B and C <I D, then A(BNC) <i 
A(BN D), and by symmetry, C(DM A) < C(DN B). 
(iv) In (iii), AABNC)N BN D=C(DNA)N DNB. 
Equivalently, AABN C)N D=C(DNA)NB. 


Proof. (i) For h € H,k € K, we have f(h)f(k)f(h)~! = f(hkh-!) € f(K). 

(ii) Let f be the canonical map of G onto G/N. By (i) we have NK/N < NH/N. 
The result follows from the correspondence theorem. 

(iii) Apply (ii) with G= B,N = A,K =BNC,H=BnD. 

(iv) The two versions are equivalent because A(B MC) < Band C(DN A) < D. Ifa 
belongs to the set on the left, then « = ac for some a € A,c € BNC, and « also belongs 
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to D. But x = c(c~tac) = ca* for some a* € A <I B. Since x € Dandceé C < D, 
we have a* € D, hence a* € DN A. Thus x = ca* € C(DN A), and since x = ac, with 
a€A<Bandce BNC < B,x€C(DNA)NB. Therefore the left side is a subset of 
the right side, and a symmetrical argument completes the proof. d& 


The diagram below is helpful in visualizing the next result. 


To keep track of symmetry, take mirror images about the dotted line. Thus the group A 
will correspond to C, B to D, A(BNC) to C(DN A), and A(BN D) to C(DN B). 


5.6.3 Zassenhaus Lemma 


Let A,B,C and D be subgroups of G, with A < B and C < D. Then 


A(BND) _ C(DNB) 
A(BNC)  C(DN A)’ 


Proof. By part (iii) of (5.6.2), the quotient groups are well-defined. An element of the 
group on the left is of the form ayA(BNC),a € A,y € BND. But ay = y(y~tay) = ya*, 
a* € A. Thus ayA(BNC) = ya* A(BNC) = yA(BNC). Similarly, an element of the 
right side is of the form zC(DN A) with z€ DN B= BND. Thus if y,z € BND, then 


yA(BNC) = zA(BNC) iff z~'y € A(BNC)N BAD 
and by part (iv) of (5.6.2), this is equivalent to 
z'y€C(DNA)NDNB iff yC(DN A) = zC(DN A). 


Thus if h maps yA(BN C) to yC(D/N A), then h is a well-defined bijection from the left 
to the right side of Zassenhaus’ equation. By definition of multiplication in a quotient 
group, h is an isomorphism. d& 


5.6.4 Definitions and Comments 


If a subnormal series is refined by inserting H between G; and G41, let us allow H to 
coincide with G; or Gi41. If all such insertions are strictly between the “endgroups”, we 
will speak of a proper refinement. Two series are equivalent if they have the same length 
and their factor groups are the same, up to isomorphism and rearrangement. 


5.6.5 Schreier Refinement Theorem 


Let l= Aj dA, A--- dA, =Gandl= Ko <4 Ki <--- dK, =G be two subnormal 
series for the group G. Then the series have equivalent refinements. 
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Proof. Let Ai; = A; ( Ajai N K;), Ki; = Kj (Bj41 MN H;). By Zassenhaus we have 


Ajai Kits; 


Hi; Ki 


(In (5.6.3) take A = H;, B = Hi4i1, C = Kj, D = Kj41). We can now construct equivalent 
refinements; the easiest way to see this is to look at a typical concrete example. The first 
refinement will have r blocks of length s, and the second will have s blocks of length r. 
Thus the length will be rs in both cases. With r = 2 and s = 3, we have 


1 = Hoo 3 Hoi S Hoe 3 Ho3 = A, = Ayo S Ai < Ae < M3 = BH. =G, 
1 = Koo 3 Kio S Koo = Ky = Koi 2 Ay SI Kay = Ke = Koz 3 Kin J Kg = Kg =G. 


The corresponding factor groups are 


Ho1/Hoo0 = Ki0/Koo, Ho2/Ho = Kii/Ko1, Hos/Ho2 = Ki2/Ko2 
Hy /Hio = Ke0/Ki0, Hie/Hu = Koi /Ki1, His/Hi2 = Ke2/Ki2. 


(Notice the pattern; in each isomorphism, the first subscript in the numerator is increased 
by 1 and the second subscript is decreased by 1 in going from left to right. The subscripts 
in the denominator are unchanged.) The factor groups of the second series are a reordering 
of the factor groups of the first series. d& 


The hard work is now accomplished, and we have everything we need to prove the 
main result. 


5.6.6 Jordan-Holder Theorem 


If G has a composition series S (in particular if G is finite), then any subnormal se- 
ries R without repetition can be refined to a composition series. Furthermore, any two 
composition series for G are equivalent. 


Proof. By (5.6.5), R and S have equivalent refinements. Remove any repetitions from 
the refinements to produce equivalent refinements Ro and So without repetitions. But a 
composition series has no proper refinements, hence So = S, proving the first assertion. 
If R is also a composition series, then Rp = R as well, and R is equivalent to S. & 


Problems For Section 5.6 


1. Show that if G has a composition series, so does every normal subgroup of G. 
2. Give an example of a group that has no composition series. 


3. Give an example of two nonisomorphic groups with the same composition factors, up 
to rearrangement. 


Problems 4-9 will prove that the alternating group A, is simple for all n > 5. (A, 
and Ag are trivial and hence not simple; A, is not simple by the example given in (5.6.1); 
Ag is cyclic of order 3 and is therefore simple.) In these problems, N stands for a normal 
subgroup of An. 
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4. Show that if n > 3, then A,, is generated by 3-cycles. 
5. Show that if N contains a 3-cycle, then it contains all 3-cycles, so that N = An. 


6. From now on, assume that N is a proper normal subgroup of A,, and n > 5. Show 
that no permutation in N contains a cycle of length 4 or more. 


7. Show that no permutation in N contains the product of two disjoint 3-cycles. Thus 
in view of Problems 4,5 and 6, every member of N is the product of an even number 
of disjoint transpositions. 


8. In Problem 7, show that the number of transpositions in a nontrivial member of N 
must be at least 4. 


9. Finally, show that the assumption that N contains a product of 4 or more disjoint 
transpositions leads to a contradiction, proving that N = 1, so that A, is simple. It 
follows that a composition series for S;, is 1<4 And Sy. 


10. A chief series is a normal series without repetition that cannot be properly refined 
to another normal series. Show that if G has a chief series, then any normal series 
without repetition can be refined to a chief series. Furthermore, any two chief series 
of a given group are equivalent. 


11. In a composition series, the factor groups Gj41/G; are required to be simple. What 
is the analogous condition for a chief series? 


5.7 Solvable And Nilpotent Groups 


Solvable groups are so named because of their connection with solvability of polynomial 
equations, a subject to be explored in the next chapter. To get started, we need a property 
of subgroups that is stronger than normality. 


5.7.1 Definitions and Comments 


A subgroup H of the group G is characteristic (in G) if for each automorphism f of G, 
f(H) = H. Thus f restricted to H is an automorphism of H. Consequently, if H is 
characteristic in G, then it is normal in G. If follows from the definition that if H is 
characteristic in K and K is characteristic in G, then H is characteristic in G. Another 
useful result is the following. 


(1) If H is characteristic in AK and K is normal in G, then H is normal in G. 


To see this, observe that any inner automorphism of G maps K to itself, so restricts 
to an automorphism (not necessarily inner) of K. Further restriction to H results in an 
automorphism of H, and the result follows. 


5.7.2. More Definitions and Comments 


The commutator subgroup G’ of a group G is the subgroup generated by all commu- 
tators [x,y] = xyx~ty~+. (Since [z,y]~' = [y,2], G’ consists of all finite products of 


commutators.) Here are some basic properties. 
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(2) G’ is characteristic in G. 

This follows because any automorphism f maps a commutator to a commutator: 
flx, yl = (f(z), FY). 
(3) G is abelian if and only if G’ is trivial. 

This holds because [, y] = 1 iff ey = yz. 


(4) G/G’ is abelian. Thus forming the quotient of G by G’, sometimes called modding 
out by G’, in a sense “abelianizes” the group. 


For G'rG’y = G'yG'z iff G'zy = G'yz iff zy(yx)—! € G" iff zyx—!y—1 € G’, and this 
holds for all 2 and y by definition of G’. 


(5) If N dG, then G/N is abelian if and only if G’ < N. 
The proof of (4) with G’ replaced by N shows that G/N is abelian iff all commutators 


belong to N, that is, iff G’ < N. 
The process of taking commutators can be iterated: 


GO =G, GV =a’, GM =(4Y, 
and in general, 
Gut) — (GOO 4='0; 1,232: 


Since G+) is characteristic in G, an induction argument shows that each G is 
characteristic, hence normal, in G. 

The group G is said to be solvable if G'") = 1 for some r. We then have a normal 
series 


f= GMa GO) 4h. GO = GE 


called the derived series of G. 

Every abelian group is solvable, by (3). Note that a group that is both simple and 
solvable must be cyclic of prime order. For the normal subgroup G’ must be trivial; if it 
were G, then the derived series would never reach 1. By (3), G is abelian, and by (5.5.1), 
G must be cyclic of prime order. 

A nonabelian simple group G (such as A,,n > 5) cannot be solvable. For if G is 
nonabelian, then G’ is not trivial. Thus G’ = G, and as in the previous paragraph, the 
derived series will not reach 1. 

There are several equivalent ways to describe solvability. 


5.7.3 Proposition 


The following conditions are equivalent. 


(i) G is solvable. 


(ii) G has a normal series with abelian factors. 
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(iii) G has a subnormal series with abelian factors. 


Proof. Since (i) implies (ii) by (4) and (ii) implies (iii) by definition of normal and sub- 
normal series, the only problem is (iii) implies (i). Suppose G has a subnormal series 


1=G, 9G,-14---dGi dG@=G 


with abelian factors. Since G/G is abelian, we have G’ < G by (5), and an induction 
argument then shows that G < G; for all i. [The inductive step is G+) = (GM) < 
Gi < Gigi since G;/Gi41 is abelian.]| Thus GO <G.=1. & 


The next result gives some very useful properties of solvable groups. 


5.7.4 Proposition 


Subgroups and quotients of a solvable group are solvable. Conversely, if N is a normal 
subgroup of G and both N and G/N are solvable, then G is solvable. 


Proof. If H is a subgroup of the solvable group G, then H is solvable because H™ < G® 
for all 2. If N is a normal subgroup of the solvable group G, observe that commutators of 
G/N look like zyx~'y~!N, so (G/N)! = G'N/N. (Not G’/N, since N is not necessarily 
a subgroup of G’.) Inductively, 


(G/N) = GON/N 


and since N/N is trivial, G/N is solvable. Conversely, suppose that we have a subnormal 
series from No = 1 to N, = N, and a subnormal series from Go/N = 1 (ie., Go = N) 
to G,/N = G/N (ie., G; = G) with abelian factors in both cases. Then we splice the 
series of N;’s to the series of G;’s. The latter series is subnormal by the correspondence 
theorem, and the factors remain abelian by the third isomorphism theorem. @& 


5.7.5 Corollary 


If G has a composition series, in particular if G is finite, then G is solvable if and only if 
the composition factors of G are cyclic of prime order. 


Proof. Let Gi+1/G; be a composition factor of the solvable group G. By (5.7.4), Gi41 is 
solvable, and again by (5.7.4), Gi41/G; is solvable. But a composition factor must be a 
simple group, so G;11/G; is cyclic of prime order, as observed in (5.7.2). Conversely, if 
the composition factors of G are cyclic of prime order, then the composition series is a 
subnormal series with abelian factors. & 


Nilpotent groups arise from a different type of normal series. We will get at this idea 
indirectly, and give an abbreviated treatment. 
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5.7.6 Proposition 


If G is a finite group, the following conditions are equivalent, and define a nilpotent group. 
[Nilpotence of an arbitrary group will be defined in (5.7.8).] 


(a) G is the direct product of its Sylow subgroups. 
(b) Every Sylow subgroup of G is normal. 


Proof. (a) implies (b): By (1.5.3), the factors of a direct product are normal subgroups. 

(b) implies (a): By (5.5.4), there is a unique Sylow p,-subgroup H; for each prime 
divisor p; of |G|,i =1,...,k. By successive application of (5.2.4), we have |H,--- H,| = 
|H,|---|H;|, which is |G| by definition of Sylow p-subgroup. Since all sets are finite, 
G = H,.---H,. Furthermore, each H; Lj H; is trivial, because the orders of the Hj; 
are powers of distinct primes. By (1.5.4), G is the direct product of the H;. & 


5.7.7 Corollary 
Every finite abelian group and every finite p-group is nilpotent. 


Proof. A finite abelian group must satisfy condition (b) of (5.7.6). If P is a finite p- 
group, then P has only one Sylow subgroup, P itself, so the conditions of (5.7.6) are 
automatically satisfied. d& 


We now connect this discussion with normal series. Suppose that we are trying to 
build a normal series for the group G, starting with Go = 1. We take G) to be Z(G), the 
center of G; we have G' <I G by (5.1.3), Example 3. We define G2 by the correspondence 
theorem: 


G2/G, = Z(G/G1) 
and since Z(G/G1) I G/G, we have Gp <1 G. In general, we take 
G;/Gi-1 = Z(G/Gi-1), 


and by induction we have G; < G. The difficulty is that there is no guarantee that G; will 
ever reach G. However, we will succeed if G is a finite p-group. The key point is that a 
nontrivial finite p-group has a nontrivial center, by (5.5.3). Thus by induction, G;/Gi_1 
is nontrivial for every 7, so Gj_1 < Gj. Since G is finite, it must eventually be reached. 


5.7.8 Definitions and Comments 


A central series for G is a normal series 1 = Go < G, <I --- d G, = G such that 
G;/Gi-1 © Z(G/Gi_1) for every i = 1,...,r. (The series just discussed is a special 
case called the upper central series.) An arbitrary group G is said to be nilpotent if it 
has a central series. Thus a finite p-group is nilpotent, and in particular, every Sylow 
p-subgroup is nilpotent. Now a direct product of a finite number of nilpotent groups is 
nilpotent. (If Gj; is the i*” term of a central series of the j*” factor Hj, with Gi; = G 
if the series has already terminated at G, then ||; Gi; will be the i? term of a central 
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series for |], Hj.) Thus a finite group that satisfies the conditions of (5.7.6) has a central 
series. Conversely, it can be shown that a finite group that has a central series satisfies 
(5.7.6), so the two definitions of nilpotence agree for finite groups. 

Note that a nilpotent group is solvable. For if G;/G;_1 C Z(G/G;_1), then the 
elements of G;/G;_1 commute with each other since they commute with everything in 
G/G;-1; thus G;/G;_1 is abelian. Consequently, a finite p-group is solvable. 


Problems For Section 5.7 


. Give an example of a nonabelian solvable group. 

. Show that a solvable group that has a composition series must be finite. 

. Prove directly (without making use of nilpotence) that a finite p-group is solvable. 
. Give an example of a solvable group that is not nilpotent. 

. Show that if n > 5, then S, is not solvable. 

. If P is a finite simple p-group, show that P has order p. 


Nm ot FW NY 


. Let P be a nontrivial finite p-group. Show that P has a normal subgroup N whose 
index [P : N] is p. 

8. Let G be a finite group of order p"m, where r is a positive integer and p does not 

divide m. Show that for any k = 1,2,...,r, G has a subgroup of order p*. 


9. Give an example of a group G with a normal subgroup N such that N and G/N are 
abelian, but G is not abelian. (If “abelian” is replaced by “solvable”, no such example 
is possible, by (5.7.4).) 
10. If G is a solvable group, its derived length, dl(G), is the smallest nonnegative integer r 
such that G(”) = 1. If N is a normal subgroup of the solvable group G, what can be 
said about the relation between dl(G), dl(NV) and dl(G/N)? 


5.8 Generators And Relations 


In (1.2.4) we gave an informal description of the dihedral group via generators and rela- 
tions, and now we try to make the ideas more precise. 


5.8.1 Definitions and Comments 


The free group G on the set S (or the free group with basis S) consists of all words on S, 
that is, all finite sequences 41---2%,, n = 0,1,..., where each x; is either an element 
of S or the inverse of an element of S. We regard the case n = 0 as the empty word X. 
The group operation is concatenation, subject to the constraint that if s and s~! occur 
in succession, they can be cancelled. The empty word is the identity, and inverses are 
calculated in the only reasonable way, for example, (stu)~' = u~'t~!s~!. We say that G 
is free on S. 

Now suppose that G is free on S, and we attempt to construct a homomorphism f 
from G to an arbitrary group H. The key point is that f is completely determined by its 
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values on S'. If f(s1) =a, f(s2) =, f(s3) =c, then 


f(sisy's3) = f(s1)f(s2)~* f(s3) = ab te. 


Here is the formal statement, followed by an informal proof. 


5.8.2 Theorem 


If G is free on S and g is an arbitrary function from S to a group H, then there is a 
unique homomorphism f: G — H such that f = gon S. 


Proof. The above discussion is a nice illustration of a concrete example with all the 
features of the general case. The analysis shows both existence and uniqueness of f. A 
formal proof must show that all aspects of the general case are covered. For example, 
if u = 8187/83 and v = 8183's, '8483, then f(u) = f(v), so that cancellation of sz's4 
causes no difficulty. Specific calculations of this type are rather convincing, and we will 
not pursue the formal details. (See, for example, Rotman, An Introduction to the Theory 
of Groups, pp. 343-345.) & 


5.8.3 Corollary 


Any group H is a homomorphic image of a free group. 


Proof. Let S be a set of generators for H (if necessary, take S = H), and let G be free 
on S. Define g(s) = s for all s € S. If f is the unique extension of g to G, then since $ 
generates H, f isan epimorphism. & 


Returning to (1.2.4), we described a group H using generators R and F’, and relations 
R" =1, F? =I, RF = FR“. The last relation is equivalent to RF RF = I, since F? = I. 
The words R”, F? and RFRF are called relators, and the specification of generators and 
relations is called a presentation. We use the notation 


H =(R,F | R", F’, RFRF) 


or the long form 


Hah P| Roa SRE SPR), 


We must say precisely what it means to define a group by generators and relations, and 
show that the above presentation yields a group isomorphic to the dihedral group D2,. 
We start with the free group on {R, F'} and set all relators equal to the identity. It is 
natural to mod out by the subgroup generated by the relators, but there is a technical 
difficulty; this subgroup is not necessarily normal. 
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5.8.4 Definition 


Let G be free on the set S, and let K be a subset of G. We define the group (S' | K) as 
G/K, where K is the smallest normal subgroup of G containing K. 

Unfortunately, it is a theorem of mathematical logic that there is no algorithm which 
when given a presentation, will find the order of the group. In fact, there is no algorithm 
to determine whether a given word of (S| K) coincides with the identity. Logicians say 
that the word problem for groups is unsolvable. But although there is no general solution, 
there are specific cases that can be analyzed, and the following result is very helpful. 


5.8.5 Von Dyck’s Theorem 


Let H = (S'| K) be a presentation, and let L be a group that is generated by the words 
in S. If L satisfies all the relations of K, then there is an epimorphism a: H — L. 
Consequently, |H| > |Z]. 


Proof. Let G be free on S, and let 2 be the identity map from S', regarded as a subset of G, 
to S, regarded as a subset of L. By (5.8.2), 7 has a unique extension to a homomorphism 
f of G into L, and in fact f is an epimorphism because S generates L. Now f maps 
any word of G to the same word in L, and since LF satisfies all the relations, we have 
K Cker f. But the kernel of f is a normal subgroup of G, hence K C ker f. The factor 
theorem provides an epimorphism a: G/K > L. @& 


5.8.6 Justifying a presentation 

If L is a finite group generated by the words of $, then in practice, the crucial step in 
identifying L with H = (S | K) is a proof that |H| < |Z]. If we can accomplish this, 
then by (5.8.5), |H| = |Z]. In this case, a is a surjective map of finite sets of the same 
size, so q@ is injective as well, hence is an isomorphism. For the dihedral group we have 
H=(F,R| R", F?, RFRF) and L = Do», In (1.2.4) we showed that each word of H can 
be expressed as R'F with 0 < i < n—1and0 < j < 1. Therefore |H| < 2n = |Don| = |L}. 
Thus the presentation H is a legitimate description of the dihedral group. 


Problems For Section 5.8 


1. Show that a presentation of the cyclic group of order n is (a | a”). 


2. Show that the quaternion group (see (2.1.3, Example 4)) has a presentation (a,b | 
a =1,h =? ,ab= ba. 
3. Show that H = (a,b| a3 = 1,b? =1,ba = a~!b) is a presentation of $3. 


4. Is the presentation of a group unique? 


In Problems 5-11, we examine a different way of assembling a group from subgroups, 
which generalizes the notion of a direct product. Let N be a normal subgroup of G, 
and H an arbitrary subgroup. We say that G is the semidirect product of N by H if 
G = NH and NN A =1. (If H <i G, we have the direct product.) For notational 
convenience, the letter n, possibly with subscripts, will always indicate a member of N, 
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and similarly h will always belong to H. In Problems 5 and 6, we assume that G is the 
semidirect product of N by H. 


5. 
6. 


11. 
12. 


13. 


14. 


If nyhy = Noho, show that ny = Ng and hy = ho. 
If i: N — Gis inclusion and 7: G > H is projection (a(nh) = h), then the sequence 


1 T 


1 N G A 1 


is exact. Note that 7 is well-defined by Problem 5, and verify that a is a homomor- 


phism. Show that the sequence splits on the right, i.e., there is a homomorphism 
w: H + G such that row =1. 


. Conversely, suppose that the above exact sequence splits on the right. Since w is 


injective, we can regard H (and N as well) as subgroups of G, with w and 7 as 
inclusion maps. Show that G is the semidirect product of N by H. 


. Let N and H be arbitrary groups, and let f be a homomorphism of H into Aut N, 


the group of automorphisms of N. Define a multiplication on G = N x H by 
(m1, h1)(n2, he) = (ni f(hi) (m2), hiha). 


[f(h1)(m2) is the value of the automorphism f(h1) at the element nz.) A lengthy but 
straightforward calculation shows that G is a group with identity (1,1) and inverses 
given by (n,h)~! = (f(h~!)(n~'),h71). Show that G is the semidirect product of 
N x {1} by {1} x H. 


. Show that every semidirect product arises from the construction of Problem 8. 
10. 


Show by example that it is possible for a short exact sequence of groups to split on 
the right but not on the left. 

(If h: G — N isa left-splitting map in the exact sequence of Problem 6, then h and 7 
can be used to identify G with the direct product of N and H. Thus a left-splitting 
implies a right-splitting, but, unlike the result for modules in (4.7.4), not conversely.] 
Give an example of a short exact sequence of groups that does not split on the right. 


(The Frattini argument, frequently useful in a further study of group theory.) Let 
N be a normal subgroup of the finite group G, and let P be a Sylow p-subgroup of 
N. If Ne(P) is the normalizer of P in G, show that G = Ne(P)N (= NNe(P) by 
(1.4.3)).[If g € G, look at the relation between P and gPg™'.] 

Let N = {1,a,a?,...,a"~1} be a cyclic group of order n, and let H = {1,b} bea 
cyclic group of order 2. Define f: H — Aut N by taking f(b) to be the automorphism 
that sends a to a~'. Show that the dihedral group D2, is the semidirect product of N 
by H. (See Problems 8 and 9 for the construction of the semidirect product.) 


In Problem 13, replace N by an infinite cyclic group 
fiir Gr telah at 


Give a presentation of the semidirect product of N by H. This group is called the 
infinite dihedral group Doo. 
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Concluding Remarks 


Suppose that the finite group G has a composition series 
1=Go0<G,4--:-dG,=G. 


If H; = G;/G,_1, then we say that G; is an extension of G;_, by H; in the sense that 
G;-1 < G; and G;/G;_1 = H;. If we were able to solve the extension problem (find all 
possible extensions of G;_1 by H;) and we had a catalog of all finite simple groups, then 
we could build a catalog of all finite groups. This sharpens the statement made in (5.6.1) 
about the importance of simple groups. 


Chapter - 6 


>» Galois Theory 


Chapter 6 


Galois Theory 


6.1 Fixed Fields and Galois Groups 


Galois theory is based on a remarkable correspondence between subgroups of the Galois 
group of an extension E/F and intermediate fields between E and F. In this section 
we will set up the machinery for the fundamental theorem. [A remark on notation: 
Throughout the chapter, the composition 7 o o of two automorphisms will be written as 
a product To.] 


6.1.1 Definitions and Comments 


Let G = Gal(E/F) be the Galois group of the extension E/F’. If H is a subgroup of G, 
the fied field of H is the set of elements fixed by every automorphism in H, that is, 


F(H) = {a € E: o(a) =x for every o € H}. 
If K is an intermediate field, that is, F < K < E, define 
G(Kk) = Gal(E/K) = {0 € G: o(x) = 2 for every « € K}. 
I like the term “fixing group of K” for G(K), since G(K) is the group of automorphisms 
of EF that leave K fixed. Galois theory is about the relation between fixed fields and fixing 


groups. In particular, the next result suggests that the smallest subfield F’' corresponds 
to the largest subgroup G. 


6.1.2 Proposition 
Let E/F be a finite Galois extension with Galois group G = Gal(E/F). Then 


(i) The fixed field of G is F; 
(ii) If H is a proper subgroup of G, then the fixed field of H properly contains F’. 
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Proof. (i) Let Fo be the fixed field of G. If o is an F-automorphism of FE, then by 
definition of Fo, o fixes everything in Fy. Thus the F-automorphisms of G coincide with 
the Fo-automorphisms of G. Now by (3.4.7) and (3.5.8), E/Fo is Galois. By (3.5.9), the 
size of the Galois group of a finite Galois extension is the degree of the extension. Thus 
[E : F| =[B: Fo], so by (3.1.9), F = Fo. 

(ii) Suppose that F = F(H). By the theorem of the primitive element (3.5.12), we 
have EF = F(a) for some a € E. Define a polynomial f(X) € ELX] by 


f(X) = [] (X -o(e). 
o€H 
If 7 is any automorphism in H, then we may apply 7 to f (that is, to the coefficients of f; 
we discussed this idea in the proof of (3.5.2)). The result is 


(rf)(X) = [] (X - (r2)(a)). 
oCH 

But as o ranges over all of H, so does Ta, and consequently 7 f = f. Thus each coefficient 
of f is fixed by H, so f € F[X]. Now a is a root of f, since X — o(a@) is 0 when X =a 
and o is the identity. We can say two things about the degree of /: 

(1) By definition of f, deg f = |H| < |G| = |[E: F, and, since f is a multiple of the 
minimal polynomial of a over F, 

(2) deg f > [F(a): F] = [E: F], and we have a contradiction. d& 


There is a converse to the first part of (6.1.2). 


6.1.3. Proposition 


Let E/F be a finite extension with Galois group G. If the fixed field of G is F’, then E/F 
is Galois. 


Proof. Let G = {o1,...,0n}, where o1 is the identity. To show that E/F is normal, 
we consider an irreducible polynomial f € F[X] with a root a € E. Apply each au- 
tomorphism in G to a, and suppose that there are r distinct images a = a, = 0;(a), 
Q2 = 02(a),...,@, = 0,(a). If o is any member of G, then o will map each a; to some 
a,;, and since o is an injective map of the finite set {a1,...,a,} to itself, it is surjective as 
well. To put it simply, o permutes the a;. Now we examine what o does to the elementary 
symmetric functions of the a;, which are given by 


Tr 
ey = ) ai, €2 = ; Aaj, €3 = ; A4GAJAk, +--+, 
i=l 


1<Jj i<j<k 


Since o permutes the a;, it follows that o(e;) = e; for all 7. Thus the e; belong to the 
fixed field of G, which is F' by hypothesis. Now we form a monic polynomial whose roots 
are the a;: 


GX) = (X a) (XS) Sa He peek St CHIN'S. 
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Since the e; belong to F, g € F[X], and since the a; are in E, g splits over EF. We claim 
that g is the minimal polynomial of a over F’. To see this, let h(X) = bo +b, X +--+ -+b,X™ 
be any polynomial in FX] having a as a root. Applying o; to the equation 

bo + b}a+-+:b,a" = 0 
we have 

bo + bya; +--+ ba” = 0, 
so that each a; is a root of h, hence g divides h and therefore g =min(a, F’). But our 
original polynomial f € FLX] is irreducible and has a as a root, so it must be a constant 
multiple of g. Consequently, f splits over E, proving that E’/F is normal. Since the a,;, 


i=1,...r, are distinct, g has no repeated roots. Thus a is separable over F’', which shows 
that the extension E'/F is separable. & 


It is profitable to examine elementary symmetric functions in more detail. 


6.1.4 Theorem 


Let f be a symmetric polynomial in the n variables X,,...,X,. [This means that if o is 
any permutation in S,, and we replace X; by X,,;) fori =1,...,n, then f is unchanged.] 
If €1,...,@n are the elementary symmetric functions of the X;, then f can be expressed 


as a polynomial in the e;. 


Proof. We give an algorithm. The polynomial f is a linear combination of monomials 
of the form Xj;1---X/", and we order the monomials lexicographically: Xj1---X7" > 
X;'---X8 iff the first disagreement between r; and s; results in rj; > s;. Since f is 
symmetric, all terms generated by applying a permutation o € S, to the subscripts of 
Xj'-++-X7" will also contribute to f. The idea is to cancel the leading terms (those 
associated with the monomial that is first in the ordering) by subtracting an expression 


of the form 
eyes? +++ etn = (Xp +X) (Ky X,) 
which has leading term 
KE RAY Xo May AA Kg lh Se 
This will be possible if we choose 
t) =71 —12, to =T2—-73, ---,tn-1 =Tn-1 —Tn, tn =Tn- 
After subtraction, the resulting polynomial has a leading term that is below Xj'---X/" 


in the lexicographical ordering. We can then repeat the procedure, which must terminate 
in a finite number of steps. & 
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6.1.5 Corollary 


If g is a polynomial in FX] and f(a1,...,Q@n) is any symmetric polynomial in the roots 
Q1,-.-,Qn of g, then f € FX]. 


Proof. We may assume without loss of generality that g is monic. Then in a splitting 
field of g we have 


g(X) = (X = ay) (KX —ay) =X" ey XI He + (—1)" eg. 


By (6.1.4), f is a polynomial in the e;, and since the e; are simply + the coefficients of g, 
the coefficients of f arein F. & 


6.1.6 Dedekind’s Lemma 


The result that the size of the Galois group of a finite Galois extension is the degree of 
the extension can be proved via Dedekind’s lemma, which is of interest in its own right. 
Let G be a group and E a field. A character from G to EF is a homomorphism from G 
to the multiplicative group E* of nonzero elements of FE. In particular, an automorphism 
of E defines a character with G = E*, as does a monomorphism of F into a field L. 
Dedekind’s lemma states that if o1,...,@, are distinct characters from G to FE, then the 
o; are linearly independent over FE. The proof is given in Problems 3 and 4. 


Problems For Section 6.1 
1. Express X7X_X3 + X,X3X3 + X,X2X? in terms of elementary symmetric functions. 
2. Repeat Problem 1 forX?X» + X?2X3 + Xx, Xx? + ep. + X2X3 + X_X3 + 4X, XoX3. 


3. To begin the proof of Dedekind’s lemma, suppose that the o; are linearly dependent. 
By renumbering the o; if necessary, we have 


ajo, +---apo, = 0 


where all a; are nonzero and r is as small as possible. Show that for every h and g € G, 
we have 


ds aioi(h)oi(g) =0 (1) 


and 


[Equations (1) and (2) are not the same; in (1) we have o;(h), not o;(h).] 
4. Continuing Problem 3, subtract (2) from (1) to get 


Tr 


Y= ai(ou(h) — o1(h))ou(g) = 0. (3) 


i=1 


With g arbitrary, reach a contradiction by an appropriate choice of h. 
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5. If G is the Galois group of Q(W/2) over Q, what is the fixed field of G? 
6. Find the Galois group of C/R. 
7. Find the fixed field of the Galois group of Problem 6. 


6.2 The Fundamental Theorem 


With the preliminaries now taken care of, we can proceed directly to the main result. 


6.2.1 Fundamental Theorem of Galois Theory 


Let E/F be a finite Galois extension with Galois group G. If H is a subgroup of G, 
let F(H) be the fixed field of H, and if K is an intermediate field, let G(K) be Gal(E/K), 
the fixing group of K (see (6.1.1)). 


(1) F isa bijective map from subgroups to intermediate fields, with inverse G. Both maps 
are inclusion-reversing, that is, if H,; < Hz then F(H,) > F(Ho2), and if Ki < Ko, 
then G(k4) > G(K2). 


2) Suppose that the intermediate field K corresponds to the subgroup H under the 
& 
Galois correspondence. Then 
(a) E/K is always normal (hence Galois); 
(b) K/F is normal if and only if H is a normal subgroup of G, and in this case, 


(c) the Galois group of K/F is isomorphic to the quotient group G/H. Moreover, 
whether or not K/F is normal, 


(d) [Kk : F] =|G: H] and [E: Kk] =|H#|. 


(3) If the intermediate field K corresponds to the subgroup H and o is any automorphism 
in G, then the field oK = {o(x): x € K} corresponds to the conjugate subgroup 
oHo~". For this reason, cK is called a conjugate subfield of K. 


The following diagram may aid the understanding. 


E G 
| | 
K HH 
| | 
F il 


As we travel up the left side from smaller to larger fields, we move down the right side 
from larger to smaller groups. A statement about A/F’, an extension at the bottom of 
the left side, corresponds to a statement about G/H, located at the top of the right side. 
Similarly, a statement about E'/K corresponds to a statement about H/1 = H. 


Proof. (1) First, consider the composite mapping H — F(H) > GF(H). Ifo € H theno 
fixes F(H) by definition of fixed field, and therefore 0 € GF(H) = Gal(E/F(H)). Thus 
H C GF(#). If the inclusion is proper, then by (6.1.2) part (ii) with F' replaced by F(H), 
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we have F(H) > F(H), a contradiction. [Note that E/K is a Galois extension for any 
intermediate field AK’, by (3.4.7) and (3.5.8).] Thus GF(H) = H. 

Now consider the mapping K — G(K) — FG(K) = F Gal(E/K). By (6.1.2) part (i) 
with F replaced by K, we have FG(K) = K. Since both F and G are inclusion-reversing 
by definition, the proof of (1) is complete. 

(3) The fixed field of sHo~! is the set of all x € E such that oro 1(x) = x for every 
7 € H. Thus 


F(oHo') ={2 € E: 0-'(2) € F(H)} = o(F(H)). 


(2a) This was observed in the proof of (1). 

(2b) If o is an F-monomorphism of K into F, then by (3.5.2) and (3.5.6), 7 extends 
to an F-monomorphism of E£ into itself, in other words (see (3.5.6)), an F-automorphism 
of E&. Thus each such o is the restriction to K of a member of G. Conversely, the 
restriction of an automorphism in G to K is an F-monomorphism of K into E. By (3.5.5) 
and (3.5.6), K/F is normal iff for every o € G we have o(K) = K. But by (3), o(K) 
corresponds to 7Ho~! and K to H. Thus K/F is normal iff 7Ho~! = H for every o € G, 
ie, HAG. 

(2c) Consider the homomorphism of G = Gal(E/F) to Gal(K/F’) given by o > o|lx. 
The map is surjective by the argument just given in the proof of (2b). The kernel is the 
set of all automorphisms in G that restrict to the identity on K, that is, Gal(E/K) = H. 
The result follows from the first isomorphism theorem. 

(2d) By (3.1.9), [EB : F] =[E: K][K : F]. The term on the left is |G] by (3.5.9), and 
the first term on the right is | Gal(#/K)| by (2a), and this in turn is |H| since H = G(K). 
Thus |G| = |H|[K : F], and the result follows from Lagrange’s theorem. [If K/F' is 
normal, the proof is slightly faster. The first statement follows from (2c). To prove the 
second, note that by (3.1.9) and (3.5.9), 


[E: F] |G| 


Pa em Gay 


& 


The next result is reminiscent of the second isomorphism theorem, and is best visu- 
alized via the diamond diagram of Figure 6.2.1. In the diagram, EK is the composite of 
the two fields EF and K, that is, the smallest field containing both FE and K. 


6.2.2 Theorem 


Let E/F be a finite Galois extension and K/F an arbitrary extension. Assume that F 
and K are both contained in a common field, so that it is sensible to consider the com- 
posite Ek. Then 


(1) EK/K is a finite Galois extension; 


(2) Gal(EK/Kk) is embedded in Gal(£/F), where the embedding is accomplished by 
restricting automorphisms in Gal(E.K/K) to E; 


(3) The embedding is an isomorphism if and only if EN K = F. 


6.2. THE FUNDAMENTAL THEOREM il 


K 


/~, 


a 
F 
Figure 6.2.1 


Proof. (1) By the theorem of the primitive element (3.5.12), we have E = Fla] for some 
a € E,so EK = KFlal = K{a]. The extension K[a]/K is finite because a is algebraic 
over F’, hence over K. Since a, regarded as an element of E’K, is separable over F’ and 
hence over K, it follows that EK/K is separable. [To avoid breaking the main line of 
thought, this result will be developed in the exercises (see Problems 1 and 2).] 

Now let f be the minimal polynomial of a over F’, and g the minimal polynomial of a 
over Kk. Since f € K[X] and f(a) = 0, we have g| f, and the roots of g must belong to 
E C EK = K|a] because E'/F is normal. Therefore K[a] is a splitting field for g over K, 
so by (3.5.7), K[a]/K is normal. 

(2) If o is an automorphism in Gal(E.K/K), restrict o to E, thus defining a homomor- 
phism from Gal(EK/K) to Gal(£/F). (Note that o|g is an automorphism of E because 
E/F is normal.) Now o fixes K, and if o belongs to the kernel of the homomorphism, 
then o also fixes EF, so o fixes EK = Ka]. Thus a is the identity, and the kernel is trivial, 
proving that the homomorphism is actually an embedding. 

(3) The embedding of (2) maps Gal(#K/K) to a subgroup H of Gal(£/F), and we 
will find the fixed field of H. By (6.1.2), the fixed field of Gal(EK/K) is K, and since 
the embedding just restricts automorphisms to E, the fixed field of H must be EN K. 
By the fundamental theorem, H = Gal(E/(E£M K)). Thus 


H = Gal(E/F) iffGal(E/(E0 K)) = Gal(E/F), 


and by applying the fixed field operator F, we see that this happens if and only if EM 
K=F. & 


Problems For Section 6.2 


1. Let E = F(ay,...,Q@), where each a; is algebraic and separable over F’. We are going 
to show that F is separable over Ff’. Without loss of generality, we can assume that the 
characteristic of Fis a prime p, and since F’/F is separable, the result holds for n = 0. 
To carry out the inductive step, let E; = F(ai,...,a;), so that Biz, = F;(ai41). 
Show that Ej41 = E;(E?,,). (See Section 3.4, Problems 4-8, for the notation.) 


2. Continuing Problem 1, show that FE is separable over F’. 
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3. Let FE = F(a1,...,Qn), where each a; is algebraic over F’. If for each i = 1,...,n, all 
the conjugates of a; (the roots of the minimal polynomial of a; over F') belong to F, 
show that E/F is normal. 


4. Suppose that F = Ko < Ky, <---< K, = E, where E/F is a finite Galois extension, 
and that the intermediate field K; corresponds to the subgroup H; under the Galois 
correspondence. Show that K;/K;—1 is normal (hence Galois) if and only if H; < Aj_1, 
and in this case, Gal(K;/K;_-1) is isomorphic to H;_-1/H;. 

5. Let EF and K be extensions of F’, and assume that the composite E-K is defined. If A 
is any set of generators for K over F (for example, A = K), show that EK = E(4A), 
the field formed from EF by adjoining the elements of A. 


6. Let E/F be a finite Galois extension with Galois group G, and let E’/F" be a finite 
Galois extension with Galois group G’. If 7 is an isomorphism of E and E’ with 
T(F) = F’, we expect intuitively that G & G’. Prove this formally. 


7. Let K/F be a finite separable extension. Although K need not be a normal extension 
of F’, we can form the normal closure N of K over F, as in (3.5.11). Then N/F 
is a Galois extension (see Problem 8 of Section 6.3); let G be its Galois group. Let 
H = Gal(N/K), so that the fixed field of H is K. If H’ is a normal subgroup of G 
that is contained in H, show that the fixed field of H’ is N. 


8. Continuing Problem 7, show that H’ is trivial, and conclude that 


() gHg"' = {1} 


geG 


where 1 is the identity automorphism. 


6.3 Computing a Galois Group Directly 


6.3.1 Definitions and Comments 


Suppose that FE is a splitting field of the separable polynomial f over F. The Galois 
group of f is the Galois group of the extension E’/F’. (The extension is indeed Galois; 
see Problem 8.) Given f, how can we determine its Galois group? It is not so easy, but 
later we will develop a systematic approach for polynomials of degree 4 or less. Some 
cases can be handled directly, and in this section we look at a typical situation. A useful 
observation is that the Galois group G of a finite Galois extension E'/F acts transitively 
on the roots of any irreducible polynomial h € F[X] (assuming that one, hence every, 
root of h belongs to EF). [Each o € G permutes the roots by (3.5.1). If @ and 6 are roots 
of h, then by (3.2.3) there is an F-isomorphism of F(a) and F'(3) carrying a to 3. This 
isomorphism can be extended to an F-automorphism of E by (3.5.2), (3.5.5) and (3.5.6).] 


6.3.2 Example 


Let d be a positive integer that is not a perfect cube, and let @ be the positive cube root 
of d. Let w = e?7/3 = —4 + idV/3, so that w? = e?7/3 = —£ — it V3 = -(1 +). 
The minimal polynomial of 6 over the rationals Q is f(X) = X° — d, because if f were 
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reducible then it would have a linear factor and d would be a perfect cube. The minimal 
polynomial of w over Q is g(X) = X7+ X41. (If g were reducible, it would have a 
rational (hence real) root, so the discriminant would be nonnegative, a contradiction.) 
We will compute the Galois group G of the polynomial f(X)g(X), which is the Galois 
group of F = Q(0,w) over Q. 

If the degree of E'/Q is the product of the degrees of f and g, we will be able to 
make progress. We have [Q(6) : Q] = 3 and, since w, a complex number, does not belong 
to Q(@), we have [Q(0,w) : Q(@)| = 2. Thus [Q(@,w) : Q| = 6. But the degree of 
a finite Galois extension is the size of the Galois group by (3.5.9), so G has exactly 6 
automorphisms. Now any o € G must take @ to one of its conjugates, namely 0,w or 
w@. Moreover, o must take w to a conjugate, namely w or w?. Since o is determined by 
its action on @ and w, we have found all 6 members of G. The results can be displayed as 
follows. 

1:00, w > w, order = 1 

T:0— 0, w — w?, order = 2 

0:0 w6, w > w, order = 3 

oT: 0 w, w > w?, order = 2 

a7: 0 > w?0, w > w, order = 3 

Ta: 0 w6, w > w?, order = 2 


Note that to? gives nothing new since to? = oT. Similarly, 77 = To. Thus 
Car Shores Sor a (1) 


At this point we have determined the multiplication table of G, but much more insight 
is gained by observing that (1) gives a presentation of S3 (Section 5.8, Problem 3). We 
conclude that G = S3. The subgroups of G are 


{1}, G (a); (7), (re), (ro") 


and the corresponding fixed fields are 


E, Q QWw), Q), QwA), QWs). 


To show that the fixed field of (ro) = {1,70} is Q(w6), note that (7c) has index 3 in G, so 
by the fundamental theorem, the corresponding fixed field has degree 3 over Q. Now ta 
takes w6 to w?w? = w6 and [Q(wA) : Q| = 3 (because the minimal polynomial of w over 
Qis f). Thus Q(w6) is the entire fixed field. The other calculations are similar. 


Problems For Section 6.3 


1. Suppose that E = F(a) is a finite Galois extension of F’', where a is a root of the 
irreducible polynomial f ¢ F[X]. Assume that the roots of f are ay = a,2,...,Qn. 
Describe, as best you can from the given information, the Galois group of E/F. 

2. Let E/Q be a finite Galois extension, and let 7,...,% be a basis for E over Q. 
Describe how you would find a primitive element, that is, an a € E such that E = 
Q(q@). (Your procedure need not be efficient.) 
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3. Let G be the Galois group of a separable irreducible polynomial f of degree n. Show 
that G is isomorphic to a transitive subgroup H of S,,. [Transitivity means that if 7 
and j belong to {1,2,...,n}, then for some o € H we have o(t) = j. Equivalently, 
the natural action of H on {1,...,n}, given by he x = h(z), is transitive.] 

4. Use Problem 3 to determine the Galois group of an irreducible quadratic polynomial 
aX? +bX +c € F[X],a 4 0. Assume that the characteristic of F is not 2, so that 
the derivative of f is nonzero and f is separable. 


5. Determine the Galois group of (X? — 2)(X? — 3) over Q. 

6. In the Galois correspondence, suppose that K; is the fixed field of the subgroup H;, 
i = 1,2. Identify the group corresponding to K = ki Ko. 

7. Continuing Problem 6, identify the fixed field of Hy M Ho. 


8. Suppose that F is a splitting field of a separable polynomial f over F. Show that 
E/F is separable. [Since the extension is finite by (3.2.2) and normal by (3.5.7), E/F 
is Galois.] 

9. Let G be the Galois group of f(X) = X+— 2 over Q. Thus if 6 is the positive fourth 
root of 2, then G is the Galois group of Q(@,7)/Q. Describe all 8 automorphisms in G. 


10. Show that G is isomorphic to the dihedral group Dg. 

11. Define o(6) = i0, o(i) = i, 7(0) = O@, T(t) = —i, as in the solution to Problem 10. 
Find the fixed field of the normal subgroup N = {1, 07, 07,037} of G, and verify that 
the fixed field is a normal extension of Q. 


6.4 Finite Fields 


Finite fields can be classified precisely. We will show that a finite field must have p” 
elements, where p is a prime and n is a positive integer. In addition, there is (up to 
isomorphism) only one finite field with p” elements. We sometimes use the notation 
GF (p") for this field; GF stands for “Galois field”. Also, the field with p elements will 
be denoted by F, rather than Z,, to emphasize that we are working with fields. 


6.4.1 Proposition 


Let E be a finite field of characteristic p. Then |E| = p” for some positive integer n. 
Moreover, E is a splitting field for the separable polynomial f(X) = X?" — X over F,, So 
that any finite field with p” elements is isomorphic to E. Not only is E generated by the 
roots of f, but in fact E coincides with the set of roots of f. 


Proof. Since E contains a copy of F, (see (2.1.3), Example 2), we may view E as a vector 
space over F,. If the dimension of this vector space is n, then since each coefficient in a 
linear combination of basis vectors can be chosen in p ways, we have |E| = p”. 

Now let E* be the multiplicative group of nonzero elements of FE. If a € E*, then 
aP"-l — 1 by Lagrange’s theorem, so a?” = a for every a € E, including a = 0. Thus 
each element of E is a root of f, and f is separable by (3.4.5). Now f has at most p” 
distinct roots, and as we have already identified the p” elements of F as roots of f, in 
fact f has p” distinct roots and every root of f must belong to E. & 
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6.4.2 Corollary 


If E is a finite field of characteristic p, then E/F, is a Galois extension. The Galois group 
is cyclic and is generated by the Frobenius automorphism o(x) = z?, « € E. 


Proof. E is a splitting field for a separable polynomial over F,, so E'/F, is Galois; see 
(6.3.1). Since x? = x for each x € F,, F, is contained in the fixed field F((o)). But 
each element of the fixed field is a root of X? — X, so F((c)) has at most p elements. 
Consequently, F((c)) = F,. Now F, = F(Gal(E/F,)) by (6.1.2), so by the fundamental 
theorem, Gal(E/F,) = (co). & 


6.4.3 Corollary 


Let E/F be a finite extension of a finite field, with |E| = p”, |F| = p™. Then E/F isa 
Galois extension. Moreover, m divides n, and Gal(E/F’) is cyclic and is generated by the 
automorphism 7(#) = 2?” ,  € E. Furthermore, F is the only subfield of E of size p™. 


Proof. If the degree of E'/F is d, then as in (6.4.1), (p™)4 = p”, so d= n/m and m | n. 
We may then reproduce the proof of (6.4.2) with F,, replaced by F’, o by 7, «? by xP”, 
and X? by X?”. Uniqueness of F as a subfield of E with p™ elements follows because 
there is only one splitting field over F, for X?" —X inside E; see (3.2.1). & 


How do we know that finite fields (other than the F,) exist? There is no problem. 
Given any prime p and positive integer n, we can construct EF = GF(p"”) as a splitting 
field for XP" — X over F,,. We have just seen that if E contains a subfield F' of size p™, 
then m is a divisor of n. The converse is also true, as a consequence of the following basic 
result. 


6.4.4 Theorem 


The multiplicative group of a finite field is cyclic. More generally, if G is a finite subgroup 
of the multiplicative group of an arbitrary field, then G is cyclic. 


Proof. G is a finite abelian group, hence contains an element g whose order r is the 
exponent of G, that is, the least common multiple of the orders of all elements of G; see 
Section 1.1, Problem 9. Thus if x € G then the order of x divides r, so x" = 1. Therefore 
each element of G is a root of X" — 1, so |G| < r. But |G| is a multiple of the order of 
every element, so |G| is at least as big as the least common multiple, so |G| > r. We 
conclude that the order and the exponent are the same. But then g has order |G], so 
G = (g) and G is cyclic. & 


6.4.5 Proposition 


GF (p™) is a subfield of E = GF(p”) if and only if m is a divisor of n. 


Proof. The “only if” part follows from (6.4.3), so assume that m divides n. If ¢ is any 
positive integer greater than 1, then m | n iff (#” — 1) | (t* —1). (A formal proof is not 
difficult, but I prefer to do an ordinary long division of t’ — 1 by t’” — 1. The successive 
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quotients are t?~™, t?—2™,47—3™__ so the division will be successful iff n — rm = 0 for 
some positive integer r.) Taking t = p, we see that p™ — 1 divides |E*|, so by (6.4.4) 
and (1.1.4), E* has a subgroup H of order p™—1. By Lagrange’s theorem, each x € HU{0} 
satisfies x?” =a. As in the proof of (6.4.1), H U {0} coincides with the set of roots of 
XP" — X. Thus we may construct entirely inside GF(p”) a splitting field for X?” — X 
over F,. But this splitting field is a copy of GF(p™). dé 


In practice, finite fields are constructed by adjoining roots of carefully selected irre- 
ducible polynomials over F,. The following result is very helpful. 


6.4.6 Theorem 


Let p be a prime and n a positive integer. Then X?” — X is the product of all monic 
irreducible polynomials over F, whose degree divides n. 


Proof. Let us do all calculations inside E = GF (p") = the set of roots of f(X) = XP" —X. 
If g(X) is any monic irreducible factor of f(X), and deg g = m, then all roots of g lie 
in E. If a is any root of g, then F,,(q) is a finite field with p™ elements, so m divides n by 
(6.4.5) or (6.4.3). Conversely, let g(X) be a monic irreducible polynomial over F, whose 
degree m is a divisor of n. Then by (6.4.5), E& contains a subfield with p™ elements, 
and this subfield must be isomorphic to F,(a). If 6 € E corresponds to a under this 
isomorphism, then g(3) = 0 (because g(a) = 0) and f(3) = 0 (because 3 € E). Since g is 
the minimal polynomial of @ over F,,, it follows that g(X) divides f(X). By (6.4.1), the 
roots of f are distinct, so no irreducible factor can appear more than once. The theorem 
is proved. & 


6.4.7 The Explicit Construction of a Finite Field 


By (6.4.4), the multiplicative group E* of a finite field E = GF(p”) is cyclic, so E* can 
be generated by a single element a. Thus E = F,(a) = F,[al, so that a is a primitive 
element of #. The minimal polynomial of a over Fy is called a primitive polynomial. The 
key point is that the nonzero elements of E are not simply the nonzero polynomials of 
degree at most n— 1 in a, they are the powers of a. This is significant in applications to 
coding theory. Let’s do an example over F9. 

The polynomial g(X) = X*++ X +1 is irreducible over F2. One way to verify this is 
to factor X16 — X = X1° + X over Fo; the factors are the (necessarily monic) irreducible 
polynomials of degrees 1,2 and 4. To show that g is primitive, we compute powers of a: 

a =1, a! =a, a? =a”, a =a°, at =1+ 4 (since g(a) = 0), 

a =a+a7, 0 =o07 + 0%, a =o? +07=14+0+4+ 0%, oF =a+a? +0*%=1+ 07 
(since 1+1=0 in F9), 

a? = ata’, al? = 1+a+a?, al! = ata?+a3, al? = 1tata?+a3, al’ = 1+a?+a3, 
qi4 =1+ ae, 
and at this point we have all 24— 1 = 15 nonzero elements of GF(16). The pattern now 
repeats, beginning with a!° =a+a*=1. 


For an example of a non-primitive polynomial, see Problem 1. 
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Problems For Section 6.4 


Li 
2: 


10. 


Verify that the irreducible polynomial X4+X°+X?+X +1 € Fo[X] is not primitive. 


Let F be a finite field and d a positive integer. Show that there exists an irreducible 
polynomial of degree d in FX]. 


. In (6.4.5) we showed that m | n iff (¢" — 1) | (t? — 1) (€ = 2,3,...). Show that an 


equivalent condition is (X” — 1) divides (X” — 1). 

If EF is a finite extension of a finite field, or more generally a finite separable extension 
of a field F’, then by the theorem of the primitive element, E = F(a) for some a € E. 
We now develop a condition equivalent to the existence of a primitive element. 


. Let E/F bea finite extension, with E = F(a) and F < L < E. Suppose that the min- 


imal polynomial of a over L is g(X) = 71-9 b:X* +X", and let K = F(bo,...,b,—1). 
If h is the minimal polynomial of a over K, show that g = h, and conclude that 
L=K. 


. Continuing Problem 4, show that there are only finitely many intermediate fields L 


between F and F. 


. Conversely, let E = F(a1,...,@,) be a finite extension with only finitely many inter- 


mediate fields between EF and F’. We are going to show by induction that E/F has a 
primitive element. If n = 1 there is nothing to prove, so assume the result holds for 
all integers less than n. If L = F(ai,...,Q@n—1), show that FE = F(G,a,,) for some 
BEL. 


. Now assume (without loss of generality) that F is infinite. Show that there are distinct 


elements c,d € F such that F(c8 + an) = F(dB + an). 


. Continuing Problem 7, show that E = F(cG + a,). Thus a finite extension has a 


primitive element iff there are only finitely many intermediate fields. 


. Let a be an element of the finite field GF'(p"). Show that a and a? have the same 


minimal polynomial over Fp. 


Suppose that a is an element of order 13 in the multiplicative group of nonzero 
elements in GF'(3"). Partition the integers {0,1,...,12} into disjoint subsets such 
that if i and j belong to the same subset, then a* and a) have the same minimal 
polynomial. Repeat for a an element of order 15 in GF(2”). [Note that elements of 
the specified orders exist, because 13 divides 26 = 33 — 1 and 15 = 24-11] 


6.5 Cyclotomic Fields 


6.5.1 Definitions and Comments 


Cyclotomic extensions of a field F are formed by adjoining n‘” roots of unity. Formally, a 
cyclotomic extension of F is a splitting field E for f(X) = X"—1 over F’. The roots of f 
are called n*” roots of unity, and they form a multiplicative subgroup of the group E* of 
nonzero elements of E. This subgroup must be cyclic by (6.4.4). A primitive n*” root of 
unity is one whose order in E* is n. 
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It is tempting to say “obviously, primitive n*” roots of unity must exist, just take a 
generator of the cyclic subgroup”. But suppose that F’ has characteristic p and p divides n, 
say n = mp. If w is an n” root of unity, then 


0=w"—-1=(w”—1)P 


so the order of w must be less than n. To avoid this difficulty, we assume that the 
characteristic of F does not divide n. Then f’(X) =nX"~! ¥ 0, so the greatest common 
divisor of f and f’ is constant. By (3.4.2), f is separable, and consequently E’/F is Galois. 
Since there are n distinct n” roots of unity, there must be a primitive n“” root of unity w, 
and for any such w, we have E = F(w). 

If o is any automorphism in the Galois group Gal(E’/F’), then o must take a primitive 
root of unity w to another primitive root of unity w", where r and n are relatively prime. 
(See (1.1.5).) We can identify o with r, and this shows that Gal(E/F’) is isomorphic to a 
subgroup of U,,, the group of units mod n. Consequently, the Galois group is abelian. 

Finally, by the fundamental theorem (or (3.5.9)), [E : F'] = | Gal(£/F)|, which is a 
divisor of |U,| = y(n). 

Cyclotomic fields are of greatest interest when the underlying field F is Q, the rational 
numbers, and from now on we specialize to that case. The primitive n‘” roots of unity 
are e2""/" where r and n are relatively prime. Thus there are y(n) primitive n‘” roots 
of unity. Finding the minimal polynomial of a primitive n“” root of unity requires some 
rather formidable equipment. 


6.5.2 Definition 


The n‘” cyclotomic polynomial is defined by 


U(X) = [[(X - 4) 


a 


where the w; are the primitive n” roots of unity in the field C of complex numbers. Thus 
the degree of U,,(X) is y(n). 

From the definition, we have ¥|(X) = X — 1 and W2(X) = X +1. In general, the 
cyclotomic polynomials can be calculated by the following recursion formula, in which d 
runs through all positive divisors of n. 


6.5.3. Proposition 


X”—1=]] Wa(X). 
d|n 


In particular, if p is prime, then 


XP —1 
= => 


aa XPV 4 XP K+, 


V,(X 
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Proof. If w is an n*” root of unity, then its order in C* is a divisor d of n, and in this 


case, w is a primitive d‘” root of unity, hence a root of V(X). Conversely, if d | n, then 
any root of Vq(X) is a d‘”, hence an n*”, root of unity. & 


From (6.5.3) we have 
W3(X) = X74 X41, 
W(X) = X?7 +1, 5(X) = X44 X84 X?74X41, 
3 
V6(X) = (x— a ~ (xs aaa a Ht = bee Gea 
It is a natural conjecture that all coefficients of the cyclotomic polynomials are integers, 
and this turns out to be correct. 


6.5.4 Proposition 
W,(X) € ZX]. 
Proof. By (6.5.3), we have 


[TL wa(X)¥n(X). 


d|n,d<n 


By definition, the cyclotomic polynomials are monic, and by induction hypothesis, the 
expression in brackets is a monic polynomial in Z[X]. Thus W,,(X) is the quotient of two 
monic polynomials with integer coefficients. At this point, all we know for sure is that 
the coefficients of U,,(X) are complex numbers. But if we apply ordinary long division, 
even in C, we know that the process will terminate, and this forces the quotient V,,(X) 
to be in Z[X]. & 


We now show that the n*” cyclotomic polynomial is the minimal polynomial of each 
primitive n“” root of unity. 


6.5.5 Theorem 
W,,(X) is irreducible over Q. 


Proof. Let w be a primitive n*” root of unity, with minimal polynomial f over Q. Since 
w is a root of X" — 1, we have X” — 1 = f(X)g(X) for some g € Q[X]. Now it follows 
from (2.9.2) that if a monic polynomial over Z is the product of two monic polynomials f 
and g over Q, then in fact the coefficients of f and g are integers. 

If p is a prime that does not divide n, we will show that w? is a root of f. If not, 
then it is a root of g. But g(w”) = 0 implies that w is a root of g(X?), so f(X) divides 
g(X?), say g(X?) = f(X)h(X). As above, h € Z[X]. But by the binomial expansion 
modulo p, g(X)? = g(X”) = f(X)h(X) mod p. Reducing the coefficients of a polynomial 
k(X) mod p is equivalent to viewing it as an element k € F,[X], so we may write g(X)? = 
f(X)h(X). Then any irreducible factor of f must divide 7, so f and g have a common 
factor. But then X" — I has a multiple root, contradicting (3.4.2). [This is where we use 
the fact that p does not divide n.] 

Now we claim that every primitive n‘” root of unity is a root of f, so that deg f > 
y(n) =deg W,,, and therefore f = V, by minimality of f. The best way to visualize this 


h 
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is via a concrete example with all the features of the general case. If w is a primitive n‘” 


root of unity where n = 175, then w”? is a primitive n*” root of unity because 72 and 175 
are relatively prime. Moreover, since 72 = 2? x 37, we have 


w”? = (((((w)?)?)’)?) 


and the result follows. & 


6.5.6 Corollary 


The Galois group G of the n*” cyclotomic extension Q(w) /Q is isomorphic to the group Up, 
of units mod n. 


Proof. By the fundamental theorem, |G| = [Q(w) : Q| = deg V,, = y(n) = |U,,|. Thus the 
monomorphism of G and a subgroup of U,, (see (6.5.1)) is surjective. d& 


Problems For Section 6.5 
1. If pis prime and p divides n, show that U,,(X) = U,(X”). (This formula is sometimes 
useful in computing the cyclotomic polynomials.) 


2. Show that the group of automorphisms of a cyclic group of order n is isomorphic to 
the group U,, of units mod n. (This can be done directly, but it is easier to make use 
of the results of this section.) 


We now do a detailed analysis of subgroups and intermediate fields associated with the 
cyclotomic extension Q7 = Q(w)/Q where w = e!?™/7 is a primitive 7” root of unity. 
The Galois group G consists of automorphisms o;, i = 1,2,3,4,5,6, where o;(w) = w’. 


3. Show that o3 generates the cyclic group G. 


4. Show that the subgroups of G are (1) (order 1), (og) (order 2), (a2) (order 3), and 
G = (03) (order 6). 


5. The fixed field of (1) is Q7 and the fixed field of G is Q. Let K be the fixed field 
of (a6). Show that w+w! € K, and deduce that K = Q(w+w +) = Q(cos 27/7). 


6. Let L be the fixed field of (a2). Show that w + w? + w* belongs to L but not to Q. 
7. Show that L = Q(w + w? +.w%). 
8. If g=p", p prime, r > 0, show that 


U(X) =e +e tee $d 


where t= XP". 


9. Assuming that the first 6 cyclotomic polynomials are available [see after (6.5.3)], cal- 
culate Vjg(X) in an effortless manner. 
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6.6 The Galois Group of a Cubic 


Let f be a polynomial over F’, with distinct roots 71,...,2» in a splitting field E over F. 
The Galois group G of f permutes the x;, but which permutations belong to G? When f 
is a quadratic, the analysis is straightforward, and is considered in Section 6.3, Problem 4. 
In this section we look at cubics (and some other manageable cases), and the appendix 
to Chapter 6 deals with the quartic. 


6.6.1 Definitions and Comments 
Let f be a polynomial with roots 71,...,2, in a splitting field. Define 
A(f) = [[(« - «;). 
i<j 
The discriminant of f is defined by 
D(f) = A? = [[ (ei -—2;)?. 
i<j 


Let’s look at a quadratic polynomial f(X) = X? + bX +, with roots $(—b + Vb? — 4c). 
In order to divide by 2, we had better assume that the characteristic of F is not 2, and 
this assumption is usually made before defining the discriminant. In this case we have 
(x1—22)? = b?—4c, a familiar formula. Here are some basic properties of the discriminant. 


6.6.2 Proposition 
Let E be a splitting field of the separable polynomial f over F, so that E’/F is Galois. 


(a) D(f) belongs to the base field F’. 

(b) Let o be an automorphism in the Galois group G of f. Then o is an even permutation 
(of the roots of f) iff o(A) = A, and o is odd iff o(A) = —A. 

(c) GC Ay, that is, G consists entirely of even permutations, iff D(f) is the square of 
an element of F' (for short, D € F?). 


Proof. Let us examine the effect of a transposition o = (i,j) on A. Once again it is 
useful to consider a concrete example with all the features of the general case. Say 
n=15,i=7,j7 =10. Then 


3 — &7 > &3 — ©10, V3 — L10 7 V3 — L7 
T1090 — ©12 7 &7 — X12, L7 — X12 7 L109 — L12 
L7 — &g8 > L190 — Lg, L8 — L190 7 Lg — L7 
27 — X10 > L190 — V7. 
The point of the computation is that the net effect of (i,j) on A is to take x; — x; to 


its negative. Thus o(A) = —A when og is a transposition. Thus if o is any permutation, 
we have o(A) = A if A is even, and o(A) = —A if o is odd. Consequently, o(A?) = 
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(a(A))? = A?, so D belongs to the fixed field of G, which is F. This proves (a), and (b) 
follows because A #4 —A (remember that the characteristic of F' is not 2). Finally G C An 
iff o(A) =A foreveryo E€GifAEcCF(G)=F. & 


6.6.3. The Galois Group of a Cubic 


In the appendix to Chapter 6, it is shown that the discriminant of the abbreviated cubic 
X34 pX +q is —4p? — 27 q?, and the discriminant of the general cubic X? + aX? +bX +e 
is 


a*(b* — 4ac) — 4b? — 27c? + 18abe. 


Alternatively, the change of variable Y = X + § eliminates the quadratic term without 
changing the discriminant. 

We now assume that the cubic polynomial f is irreducible as well as separable. Then 
the Galois group G is isomorphic to a transitive subgroup of S3 (see Section 6.3, Prob- 
lem 3). By direct enumeration, G must be A3 or $3, and by (6.6.2(c)), G = As iff the 
discriminant D is a square in F’. 

If G = As, which is cyclic of order 3, there are no proper subgroups except {1}, so 
there are no intermediate fields strictly between E and F’. However, if G = S3, then the 
proper subgroups are 


sae (2, 3)}, {1, (1,3), th, (1,2)}, A3 = 15 (1,2, 3), (1,3, 2)}. 
If the roots of f are a,,a@2 and a3, then the corresponding fixed fields are 
F(ai), F(a2), F(a3), F(A) 


where As corresponds to F(A) because only even permutations fix A. 


6.6.4 Example 


Let f(X) = X° — 31X +62 over Q. An application of the rational root test (Section 2.9, 
Problem 1) shows that f is irreducible. The discriminant is —4(—31)?—27(62)? = 119164— 
103788 = 15376 = (124)?, which is a square in Q. Thus the Galois group of f is A3. 

We now develop a result that can be applied to certain cubics, but which has wider 
applicability as well. The preliminary steps are also of interest. 


6.6.5 Some Generating Sets of S,, 


,n). 
s4)sl 
(1,2), (2,3),...,(n—1,n). 


(i) S, is generated by the transpositions (1,2), (1,3),...,(1 
[An arbitrary transposition (7,7) can be written as (1, (1,7)0 

(ii) S;, is generated by transpositions of adjacent digits, i.e., 
[Since (1,7 —1)(j — 1,7), 7 — 1) = (1,7), we have 


(1, 2)(2,3)(1,2) = 1,3), (1,3)(3,4)(,3) = (1,4), ete., 


and the result follows from (i).] 
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(iii) S;, is generated by the two permutations a, = (1,2) and 7 = (1,2,...,n). 
[If c2 = T0171, then a is obtained by applying 7 to the symbols of a, (see Section 5.2, 
Problem 1). Thus o2 = (2,3). Similarly, 


03 = ToeT ' = (3,4),...,0n-1 = TOn_2T ' =(n—1,n), 


and the result follows from (ii).] 
(iv) S;, is generated by (1,2) and (2,3,...,n). 
[(1, 2)(2,3,...,) = (1,2,3,...,n), and (iii) applies.] 


6.6.6 Lemma 


If f is an irreducible separable polynomial over F' of degree n, and G is the Galois group 
of f, then n divides |G]. If n is a prime number p, then G contains a p-cycle. 


Proof. If a is any root of f, then [F(a) : F] = n, so by the fundamental theorem, G 
contains a subgroup whose index is n. By Lagrange’s theorem, n divides |G|. If n = p, 
then by Cauchy’s theorem, G contains an element o of order p. We can express ¢ as a 
product of disjoint cycles, and the length of each cycle must divide the order of a. Since 
p is prime, o must consist of disjoint p-cycles. But a single p-cycle already uses up all the 
symbols to be permuted, so a is a p-cycle. & 


6.6.7 Proposition 


If f is irreducible over Q and of prime degree p, and f has exactly two nonreal roots in 
the complex field C, then the Galois group G of f is Sp. 


Proof. By (6.6.6), G contains a p-cycle 7. Now one of the elements of G must be complex 
conjugation 7, which is an automorphism of C that fixes R (hence Q). Thus 7 permutes 
the two nonreal roots and leaves the p — 2 real roots fixed, so 7 is a transposition. Since 
p is prime, o* is a p-cycle for k = 1,...,p —1. It follows that by renumbering symbols if 
necessary, we can assume that (1,2) and (1,2,...,p) belong to G. By (6.6.5) part (iii), 
G=S),. & 


Problems For Section 6.6 


In Problems 1—4, all polynomials are over the rational field Q, and in each case, you are 
asked to find the Galois group G. 


1. f(X) = X3 — 2 (do it two ways) 

2. f(X)=X8-3X+1 

3.. f(X)= X° —10X4+2 

4, f(X) = X°4 3X? — 2X +1 (calculate the discriminant in two ways) 

5. If f is a separable cubic, not necessarily irreducible, then there are other possibilities 


for the Galois group G of f besides $3 and A3. What are they? 
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6. Let f be an irreducible cubic over Q with exactly one real root. Show that D(f) < 0, 
and conclude that the Galois group of f is S3. 


7. Let f be an irreducible cubic over Q with 3 distinct real roots. Show that D(f) > 0, 
so that the Galois group is A3 or $3 according as VD € Q or VD ¢~Q 


6.7 Cyclic and Kummer Extensions 


The problem of solving a polynomial equation by radicals is thousands of years old, but 
it can be given a modern flavor. We are looking for roots of f € FX], and we are only 
allowed to use algorithms that do ordinary arithmetic plus the extraction of n*” roots. 
The idea is to identify those polynomials whose roots can be found in this way. Now if 
a € F and our algorithm computes 9 = ¢/a in some extension field of F’, then 6 is a root 
of X” — a, so it is natural to study splitting fields of X” — a. 


6.7.1 Assumptions, Comments and a Definition 


Assume 

(i) E is a splitting field for f(X) = X” — a over F’, where a £ 0. 

(ii) F contains a primitive n“” root of unity w. 

These are natural assumption if we want to allow the computation of n*” roots. If 6 is 
any root of f in E, then the roots of f are 6,w0,...,w"~!0. (The roots must be distinct 
because a, hence 6, is nonzero.) Therefore F = F(@). Since f is separable, the extension 
E/F is Galois (see (6.3.1)). If G = Gal(E/F), then |G| = [E: F] by the fundamental 
theorem (or by (3.5.9)). 

In general, a cyclic extension is a Galois extension whose Galois group is cyclic. 


6.7.2. Theorem 


Under the assumptions of (6.7.1), E/F is a cyclic extension and the order of the Galois 
group G is a divisor of n. We have |G| = n if and only if f(X) is irreducible over F’. 


Proof. Let o € G; since o permutes the roots of f by (3.5.1), we have o(@) = w%(7@. 
[Note that o fixes w by (ii).] We identify integers u(a) with the same residue mod n. If 
o(0) = w"9, i = 1,2, then 


01 (02(8)) = wirn*uc)o, 
so 
u(o102) = u(o1) + u(o2) 
and u is a group homomorphism from G to Z,. If u(a) is 0 mod n, then o(0) = 6, so a is 
the identity and the homomorphism is injective. Thus G is isomorphic to a subgroup of 
Zn, 80 G is cyclic and |G| divides n. 
If f is irreducible over F’, then |G| = [E: F] = [F (0): F] = deg f =n. If f is not 


irreducible over F’, let g be a proper irreducible factor. If G is a root of g in E, then (@ is 
also a root of f, so BE = F(@) and |G| =[E: F]) =[F(G): F] =degg<n. & 
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Thus splitting fields of X” — a give rise to cyclic extensions. Conversely, we can prove 
that a cyclic extension comes from such a splitting field. 


6.7.3. Theorem 


Let E/F be a cyclic extension of degree n, where F contains a primitive n*” root of 
unity w. Then for some nonzero a € F, f(X) = X” — a is irreducible over F and E isa 
splitting field for f over F’. 


t 


Proof. Let o be a generator of the Galois group of the extension. By Dedekind’s lemma 
(6.1.6), the distinct automorphisms 1,o,07,...,0"~+ are linearly independent over E. 
Thus 1 + wo + w2a?7+--»+w"-!o! is not identically 0, so for some 3 € E we have 


9 = B+wo(B) +++» +w"*o"""(8) £0. 
Now 
o(0) = 0(8) +wo?(8) +-+-+u" 7a" (B) tw" 10"(B) =w 10 
since 0" (3) = 3. We take a = 0". To prove that a € F, note that 
o(8") = (0(8))" = (w*6)” = 0" 


and therefore o fixes 0”. Since o generates G, all other members of G fix 6”, hence a 
belongs to the fixed field of Gal(E/F), which is F’. 

Now by definition of a, @ is a root of f(X) = X”" — a, so the roots of X" — a 
are 0,w6,...,w" 10. Therefore F() is a splitting field for f over F. Since o(0) = w 18, 
the distinct automorphisms 1,¢,...,0"~! can be restricted to distinct automorphisms 
of F(@). Consequently, 


n <|Gal(F(0)/F)| = [F(9) : F] < deg f =n 


so [F(0) : F] = n. It follows that E = F(0) and (since f must be the minimal polynomial 
of 6 over F’) f is irreducible over F. & 
A finite abelian group is a direct product of cyclic groups (or direct sum, in additive 


notation; see (4.6.4)). It is reasonable to expect that our analysis of cyclic Galois groups 
will help us to understand abelian Galois groups. 


6.7.4 Definition 


A Kummer extension is a finite Galois extension with an abelian Galois group. 


6.7.5 Theorem 


Let E/F bea finite extension, and assume that F contains a primitive n” root of unity w. 
Then £'/F is a Kummer extension whose Galois group G has an exponent dividing n if 
and only if there are nonzero elements aj,...,a, € F such that F is a splitting field of 
(X” — a,)--+-(X" —a,) over F’. [For short, E = F(%/a,,..., 7a,.).] 
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Proof. We do the “if” part first. As in (6.7.1), we have EF = F(61,...,6,) where 6; is a 
root of X”" —a,;. If o € Gal(E/F), then o maps 6; to another root of X” — a;, so 


Thus if o and 7 are any two automorphisms in the Galois group G, then or = To and G 
is abelian. [The wu, are integers, so u;(o) + ui(T) = ui(T) + ui(o).] Now restrict attention 
to the extension F'(6;). By (6.7.2), the Galois group of F'(6;)/F has order dividing n, so 
o”(6;) = 6; for alli = 1,...,r. Thus o” is the identity, and the exponent of G is a divisor 
of n. 

For the “only if” part, observe that since G is a finite abelian group, it is a direct 
product of cyclic groups C),...,C,. For each i = 1,...,r, let H; be the product of the 
C; for 7 4 i; by (1.5.3), H; I G. We have G/H; = C; by the first isomorphism theorem. 
(Consider the projection mapping 71 --- x, — x; € C;.) Let K; be the fixed field of H;. By 
the fundamental theorem, K;/F is a Galois extension and its Galois group is isomorphic 
to G/H;, hence isomorphic to C;. Thus K;/F is a cyclic extension of degree d; = |C;|, 
and d; is a divisor of n. (Since G is the direct product of the C;, some element of G has 
order d;, so d; divides the exponent of G and therefore divides n.) We want to apply 
(6.7.3) with n replaced by d;, and this is possible because F' contains a primitive d‘" root 
of unity, namely w”/%. We conclude that K; = F(6;), where 0% is a nonzero element 
b € F. But 07 = 9/4) = or/" a, € F. 

Finally, in the Galois correspondence, the intersection of the H; is paired with the 
composite of the K;, which is F(1,...,6,-); see Section 6.3, Problem 7. But ();_, Hi = 1, 
so E = F(0,,...,0,), and the result follows. d& 


Problems For Section 6.7 


1. Find the Galois group of the extension Q(V2, V3, V5, V7) [the splitting field of (X?— 
2)(X? — 3)(X? — 5)(X? — 7)] over Q. 

2. Suppose that E is a splitting field for f(X) = X” —a over F', a £ 0, but we drop 
the second assumption in (6.7.1) that F contains a primitive n*” root of unity. Is it 
possible for the Galois group of E'/F to be cyclic? 

3. Let E be a splitting field for X” — a over F', where a 4 0, and assume that the 
characteristic of F does not divide n. Show that E contains a primitive n*” root of 
unity. 


We now assume that F is a splitting field for f(X) = X? — c over F, where c £ 0, p is 
prime and the characteristic of F is not p. Let w be a primitive p” root of unity in E (see 
Problem 3). Assume that f is not irreducible over F’, and let g be an irreducible factor 
of f of degree d, where 1 <d< p. Let 0 be a root of g in E. 


4, Let go be the product of the roots of g. (Since go is + the constant term of g, go € F.) 
Show that, = 0°? = c* 

5. Since d and p are relatively prime, there are integers a and b such that ad+ bp = 1. 
Use this to show that if X? — c is not irreducible over F’, then it must have a root 
in F. 
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6. Continuing Problem 5, show that if X? — c is not irreducible over F’, then E = F(w). 


7. Continuing Problem 6, show that if X? — c is not irreducible over F’, then X? —c 
splits over F if and only if F contains a primitive p“” root of unity. 


Let E'/F be a cyclic Galois extension of prime degree p, where p is the characteristic of F. 
Let o be a generator of G = Gal(E/F’). It is a consequence of Hilbert’s Theorem 90 (see 
the Problems for Section 7.3) that there is an element 6 € EF such that o(0) = 0+ 1. 
Prove the Artin-Schreier theorem: 


8. B= F(6). 
9. 0 is a root of f(X) = X? — X —a for some ae F. 
10. f is irreducible over F' (hence a # 0). 
Conversely, Let F' be a field of prime characteristic p, and let E be a splitting field for 
f(X) = X? — X — a, where a is a nonzero element of F’. 
11. If 6 is any root of f in E, show that E = F(0) and that f is separable. 


12. Show that every irreducible factor of f has the same degree d, where d = 1 or p. Thus 
if d=1, then E = F, and if d= p, then f is irreducible over F. 


13. If f is irreducible over F', show that the Galois group of f is cyclic of order p. 


6.8 Solvability By Radicals 


6.8.1 Definitions and Comments 


We wish to solve the polynomial equation f(X) = 0, f € F[X], under the restriction that 
we are only allowed to perform ordinary arithmetic operations (addition, subtraction, 
multiplication and division) on the coefficients, along with extraction of n‘” roots (for 
any n = 2,3,...). A sequence of operations of this type gives rise to a sequence of 
extensions 


F< F (04) = F (a4, 02) <3+ eS B01, 20507) = 2B 
where a}? € F and aj" € F(q,...,ai-1),t =2,...,r. Equivalently, we have 
P=Ffo<h<::-<F=L 


where F; = Fj_1(a;) and aj’ € Fi-1,i = 1,...,r. We say that E is a radical extension 
of F’. It is convenient (and legal) to assume that ny = --- = n, = n. (Replace each nj; 
by the product of all the n;. To justify this, observe that if a7 belongs to a field L, then 
a” € L,m = 2,3,....) Unless otherwise specified, we will make this assumption in all 
hypotheses, conclusions and proofs. 

We have already seen three explicit classes of radical extensions: cyclotomic, cyclic 
and Kummer. (In the latter two cases, we assume that the base field contains a primitive 
n'” root of unity.) 


24 CHAPTER 6. GALOIS THEORY 


We say that the polynomial f € F[X] is solvable by radicals if the roots of f lie in 
some radical extension of F’, in other words, there is a radical extension FE of F’ such 
that f splits over EF. 

Since radical extensions are formed by successively adjoining n*” roots, it follows that 
the transitivity property holds: If F is a radical extension of F' and L is a radical extension 
of E, then L is a radical extension of F’. 

A radical extension is always finite, but it need not be normal or separable. We 
will soon specialize to characteristic 0, which will force separability, and we can achieve 
normality by taking the normal closure (see (3.5.11)). 


6.8.2 Proposition 


Let E/F be a radical extension, and let N be the normal closure of E over F’. Then N/F 
is also a radical extension. 


Proof. E is obtained from F by successively adjoining a1,...,a;, where a; is the n‘” 


root of an element in F;-;. On the other hand, N is obtained from F’ by adjoining 
not only the a;, but their conjugates aj1,...,Qim(i). For any fixed i and j, there is an 
automorphism a € Gal(N/F) such that o(a;) = aj; (see (3.2.3), (3.5.5) and (3.5.6)). 
Thus 


ais = o(ai)" = o(a7) 


and since a? belongs to F(a1,...,ai;-1), it follows from (3.5.1) that o(a?) belongs to 
the splitting field K; of TTjaimin(a;, F ) over F. [Take K, = F, and note that since 
ay = b; € F, we have o(ai’) = o(b1) = b; € F. Alternatively, observe that by (3.5.1), ¢ 
must take a root of X” — b; to another root of this polynomial.] Thus we can display N 
as a radical extension of F' by successively adjoining 


O11,+++,A€1m(1)>+++>Arl,-++;Arm(r)- & 


6.8.3 Preparation for the Main Theorem 


If F has characteristic 0, then a primitive n*” root of unity w can be adjoined to F to 
reach an extension F'(w); see (6.5.1). If E is a radical extension of F and F = Fo < 
Fi, <.--< F, = E, we can replace F; by F;(w),i = 1,...,r, and E(w) will be a radical 
extension of F’. By (6.8.2), we can pass from E(w) to its normal closure over F’. Here is 
the statement we are driving at: 

Let f € F[X], where F has characteristic 0. If f is solvable by radicals, then there 
is a Galois radical extension N = F, >--- > F, > Fo = F containing a splitting field 
K for f over F, such that each intermediate field F;,72 = 1,...,r, contains a primitive 
n*” root of unity w. We can assume that F; = F(w) and for i > 1, F; is a splitting 
field for X” — b; over F;_,. [Look at the end of the proof of (6.8.2).] By (6.5.1), Fi/F 
is a cyclotomic (Galois) extension, and by (6.7.2), each Fj /F,-1,i = 2,...,r is a cyclic 
(Galois) extension. 
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We now do some further preparation. Suppose that K is a splitting field for f over F, 
and that the Galois group of K/F is solvable, with 


Gal(K/F) =H) > H,>---b H,=1 


with each H;_,/H; abelian. By the fundamental theorem (and Section 6.2, Problem 4), 
we have the corresponding sequence of fixed fields 


F=Ko< Ki <:::< Kp =K 


with K;/K;~1 Galois and Gal(K;/K;_1) isomorphic to Hj_,/H;. Let us adjoin a primitive 
n' root of unity w to each Kj, so that we have fields F; = K;(w) with 


PRP het ee 


We take n = | Gal(K/F)|. Since F; can be obtained from F;_; by adjoining everything 
in K; \ Ky-1, we have 


FP, = hi K; = Ki fii 


the composite of F;_,; and K;,i = 1,...,r. We may now apply Theorem 6.2.2. In the 
diamond diagram of Figure 6.2.1, at the top of the diamond we have F;, on the left K;,, 
on the right F;_1, and on the bottom K;M F;-1 2 Ki_1 (see Figure 6.8.1). We conclude 
that F;/F;-1 is Galois, with a Galois group isomorphic to a subgroup of Gal(K;/Kj_1). 
Since Gal(K;/K;i-1) © Hj-1/H;, it follows that Gal(F;/F;—1) is abelian. Moreover, the 
exponent of this Galois group divides the order of Hp, which coincides with the size of 
Gal(K/F). (This explains our choice of n.) 


ea 
Mis ae 


K,; NF 


Kj-1 


Figure 6.8.1 


6.8.4 Galois’ Solvability Theorem 


Let K be a splitting field for f over F', where F’ has characteristic 0. Then f is solvable 
by radicals if and only if the Galois group of K’/F is solvable. 
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Proof. If f is solvable by radicals, then as in (6.8.3), we have 
P=Fo<h<::-<F=N 


where N/F is Galois, N contains a splitting field K for f over F, and each F;/Fj_1 
is Galois with an abelian Galois group. By the fundamental theorem (and Section 6.2, 
Problem 4), the corresponding sequence of subgroups is 


1=H, < H,_-1 J --: Id Hp = G = Gal(N/F) 
with each H;_;/H; abelian. Thus G is solvable, and since 
Gal(K/F) = Gal(N/F)/Gal(N/K) 
[map Gal(.N/F') — Gal(K/F) by restriction; the kernel is Gal(N/K)], Gal(K/F) is solv- 
able by (5.7.4). 


Conversely, assume that Gal(K/F’) is solvable. Again as in (6.8.3), we have 


Pe SSR 


where K < F,, each F; contains a primitive n“” root of unity, with n = | Gal(K/F)|, 
and Gal(F;/Fj-1) is abelian with exponent dividing n for all i = 1,...,r. Thus each 
F,/F;-, is a Kummer extension whose Galois group has an exponent dividing n. By 
(6.7.5) (or (6.5.1) for the case i = 1), each F;/F;_, is a radical extension. By transitivity 
(see (6.8.1)), F; is a radical extension of F’. Since Kk C F;, f is solvable by radicals. & 


6.8.5 Example 


Let f(X) = X° — 10X* + 2 over the rationals. The Galois group of f is Ss, which is not 
solvable. (See Section 6.6, Problem 3 and Section 5.7, Problem 5.) Thus f is not solvable 
by radicals. 

There is a fundamental idea that needs to be emphasized. The significance of Galois’ 
solvability theorem is not simply that there are some examples of bad polynomials. The 
key point is there is no general method for solving a polynomial equation over the rationals 
by radicals, if the degree of the polynomial is 5 or more. If there were such a method, 
then in particular it would work on Example (6.8.5), a contradiction. 


Problems For Section 6.8 


In the exercises, we will sketch another classical problem, that of constructions with ruler 
and compass. In Euclidean geometry, we start with two points (0,0) and (1,0), and we 
are allowed the following constructions. 


(i) Given two points P and Q, we can draw a line joining them; 


) 
(ii) Given a point P and a line L, we can draw a line through P parallel to L; 
(iii) Given a point P and a line L, we can draw a line through P perpendicular to L; 
) 


(iv) Given two points P and Q, we can draw a circle with center at P passing through Q; 
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(v) Let A, and similarly B, be a line or a circle. We can generate new points, called 
constructible points, by forming the intersection of A and B. If (c,0) (equivalently 
(0,c)) is a constructible point, we call c a constructible number. It follows from (ii) 
and (iii) that (a,b) is a constructible point iff a and 6 are constructible numbers. It 
can be shown that every rational number is constructible, and that the constructible 
numbers form a field. Now in (v), the intersection of A and B can be found by 
ordinary arithmetic plus at worst the extraction of a square root. Conversely, the 
square roof of any nonnegative constructible number can be constructed. Therefore 
c is constructible iff there are real fields Q = Fo < F,--- < F, such that c € F, and 
each [F; : F;_;] is 1 or 2. Thus if ¢ is constructible, then c is algebraic over Q and 
[Q(c) : Q| is a power of 2. 


1. (Trisecting the angle) If it is possible to trisect any angle with ruler and compass, then 
in particular a 60 degree angle can be trisected, so that a = cos 20° is constructible. 
Using the identity 


2" — cog 34 + isin30 = (cos @ + isin @)%, 


reach a contradiction. 

2. (Duplicating the cube) Show that it is impossible to construct, with ruler and compass, 
a cube whose volume is exactly 2. (The side of such a cube would be W2.) 

3. (Squaring the circle) Show that if it were possible to construct a square with area 7, 
then 7 would be algebraic over Q. (It is known that 7 is transcendental over Q.) 

To construct a regular n-gon, that is, a regular polygon with n sides, n > 3,we must 
be able to construct an angle of 27/n; equivalently, cos 27/n must be a constructible 
number. Let w = e’2/”, a primitive n*” root of unity. 

4. Show that [Q(w) : Q(cos 27/n)] = 2. 

5. Show that if a regular n-gon is constructible, then the Euler phi function y(n) is a 
power of 2. 

Conversely, assume that y(n) is a power of 2. 

6. Show that Gal(Q(cos 27/n)/Q) is a 2-group, that is, a p-group with p = 2. 

7. By Section 5.7, Problem 7, every nontrivial finite p-group has a subnormal series in 
which every factor has order p. Use this (with p = 2) to show that a regular n-gon is 
constructible. 

8. {From the preceding, a regular n-gon is constructible if and only if y(n) is a power 
of 2. Show that an equivalent condition is that n = 2°q,---q@,s,¢t =0,1,..., where 
the q; are distinct Fermat primes, that is, primes of the form 2” + 1 for some positive 
integer m. 

9. Show that if 2” + 1 is prime, then m must be a power of 2. The only known Fermat 
primes have m = 2%, where a = 0,1,2,3,4 (2°? +1 is divisible by 641). [The key 
point is that if a is odd, then X + 1 divides X* + 1 in Z[X]; the quotient is X*~+ — 
X*74.--.—X +1 (since a — 1 is even).] 

Let F be the field of rational functions in n variables e€),...,e, over a field kK with 
characteristic 0, and let f(X) =X" —e,X"~! + e.X"-? —---+ (-1)"en € F[X]. If 
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Q1,...,Qn, are the roots of f in a splitting field over F’, then the e; are the elementary 
symmetric functions of the a;. Let E = F(a,...,@,), so that E/F is a Galois 
extension and G = Gal(E/F’) is the Galois group of f. 


10. Show that G = S,. 


11. What can you conclude from Problem 10 about solvability of equations? 


6.9 ‘Transcendental Extensions 


6.9.1 Definitions and Comments 


An extension F/F such that at least one a € EF is not algebraic over F' is said to be 
transcendental. An idea analogous to that of a basis of an arbitrary vector space V turns 
out to be profitable in studying transcendental extensions. A basis for V is a subset of V 
that is linearly independent and spans V. A key result, whose proof involves the Steinitz 
exchange, is that if {v1,...,%m} spans V and S is a linearly independent subset of V, 
then |S| < m. We are going to replace linear independence by algebraic independence 
and spanning by algebraic spanning. We will find that every transcendental extension has 
a transcendence basis, and that any two transcendence bases for a given extension have 
the same cardinality. All these terms will be defined shortly. The presentation in the 
text will be quite informal; I believe that this style best highlights the strong connection 
between linear and algebraic independence. An indication of how to formalize the devel- 
opment is given in a sequence of exercises. See also Morandi, “Fields and Galois Theory”, 
pp. 173-182. 

Let E/F be an extension. The elements t1,...,t, € EF are algebraically dependent 
over F (or the set {ti,...,t,} is algebraically dependent over F’) if there is a nonzero 
polynomial f € F[X1,...,X»,] such that f(t1,...,t,) = 0; otherwise the ¢; are alge- 
braically independent over F'. Algebraic independence of an infinite set means algebraic 
independence of every finite subset. 

Now if a set TJ spans a vector space V, then each x in V is a linear combination 
of elements of T, so that « depends on T in a linear fashion. Replacing “linear” by 
“algebraic”, we say that the element t € E depends algebraically on T over F if t is 
algebraic over F(T), the field generated by T over F' (see Section 3.1, Problem 1). We 
say that T spans EF algebraically over F if each t in E depends algebraically on T over F, 
that is, E is an algebraic extension of F(T). A transcendence basis for E'/F is a subset 
of E that is algebraically independent over F' and spans E algebraically over F. (From 
now on, we will frequently regard F as fixed and drop the phrase “over F”.) 


6.9.2 Lemma 

If S is a subset of EF, the following conditions are equivalent. 
(i) S is a transcendence basis for E'/F; 

(ii) S is a maximal algebraically independent set; 


(iii) S is a minimal algebraically spanning set. 
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Thus by (ii), S is a transcendence basis for E'/F iff S is algebraically independent and E 
is algebraic over F(S). 


Proof. (i) implies (ii): If S C T where T is algebraically independent, let u € T \ S. 
Then u cannot depend on S algebraically (by algebraic independence of T), so S cannot 
span F algebraically. 

(ii) implies (i): If S does not span E algebraically, then there exists u € E such 
that u does not depend algebraically on S. But then SU {wu} is algebraically independent, 
contradicting maximality of S. 

(i) implies (iii): If T C S and T spans E algebraically, let u € S\T. Then u depends 
algebraically on T, so TU {u}, hence S, is algebraically dependent, a contradiction. 

(iii) implies (i): If S is algebraically dependent, then some u € S' depends algebraically 
on T= S\ {u}. But then T spans F algebraically, a contradiction. d& 


6.9.3. Proposition 


Every transcendental extension has a transcendence basis. 


Proof. The standard argument via Zorn’s lemma that an arbitrary vector space has a 
maximal linearly independent set (hence a basis) shows that an arbitrary transcendental 
extension has a maximal algebraically independent set, which is a transcendence basis 
by (6.9.2). & 


For completeness, if H/F is an algebraic extension, we can regard @ as a transcendence 
basis. 


6.9.4 The Steinitz Exchange 


If {21,...,%m} spans FE algebraically and S C E is algebraically independent, then 
|S] <m. 


Proof. Suppose that S$ has at least m+ 1 elements y1,...,Ym41- Since the x; span E 
algebraically, y; depends algebraically on 21,...,2%m. The algebraic dependence relation 
must involve at least one x;, say 21. (Otherwise, S would be algebraically dependent.) 
Then x; depends algebraically on y1, v2,...,@m, 80 {y1,%2,---,Um} spans E algebraically. 
We claim that for every i = 1,...,m, {y1,.--, Yi; Vit1,---;@m} spans F algebraically. We 
have just proved the case 7 = 1. If the result holds for 7, then y;; depends algebraically on 
{Y1, +++; Yi Lit1,--+,;Lm}, and the dependence relation must involve at least one z,, say 
2441 for convenience. (Otherwise, S would be algebraically dependent.) Then 2;4, de- 
pends algebraically on y1,..., Yit1, Li-2,---;2m, 80 {Y1,---, Yit1, Vid2,---;%m} spans EF 
algebraically, completing the induction. 

Since there are more y’s than z’s, eventually the x’s disappear, and y1,...,Ym span E 
algebraically. But then y,,41 depends algebraically on y1,..., Ym, contradicting the alge- 
braic independence of S. & 
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6.9.5 Corollary 


Let S and T be transcendence bases of FE. Then either S and T are both finite or they 
are both infinite; in the former case, || = |T]. 


Proof. Assume that one of the transcendence bases, say T, is finite. By (6.9.4), |S] < |T], 
so S is finite also. By a symmetrical argument, |T| < |S|, so |S|=|T|. & 


6.9.6 Proposition 


If S and T are arbitrary transcendence bases for FE, then |.S| = |T|. [The common value 
is called the transcendence degree of E'/F.] 


Proof. By (6.9.5), we may assume that S and T are both infinite. Let T = {y;: i € I}. 
If « € S, then x depends algebraically on finitely many elements y;,,...,y;, in T. Define 
I(x) to be the set of indices {i1,...,%,}. It follows that J = U{I(x): « € S}. For if 9 
belongs to none of the I(x), then we can remove y; from T and the resulting set will still 
span E algebraically, contradicting (6.9.2) part (iii). Now an element of U{I(a): « € S} 
is determined by selecting an element x € S and then choosing an index in I(x). Since 
I(«) is finite, we have |I(a)| < No. Thus 


IZ] =|{1@): # € SH < |SIXo = [S| 


since S' is infinite. Thus |T'| < |S|. By symmetry, |S| = |T|. & 


6.9.7 Example 


Let E = F(X,...,Xn) be the field of rational functions in the variables X,..., Xn 
with coefficients in F. If f(X1,...,Xn) = 0, then f is the zero polynomial, so S = 
{X1,..., Xn} is an algebraically independent set. Since E = F(S), E is algebraic over 
F(S) and therefore S spans F algebraically. Thus S is a transcendence basis. 

Now let T = {Xj",..., Xf}, where wi,...,Un are arbitrary positive integers. We 
claim that T is also a transcendence basis. As above, T is algebraically independent. 
Moreover, each X; is algebraic over F(T). To see what is going on, look at a concrete 
example, say T = {X}, X3, X$}. If f(Z) = 73 — X$ € F(T)[Z], then Xo is a root of f, so 
X», and similarly each X;, is algebraic over F(T). By (3.3.3), E is algebraic over F(T), 
so T is a transcendence basis. 


Problems For Section 6.9 
1. If S is an algebraically independent subset of EF over F’', T spans EF algebraically over F, 
and S C T, show that there is a transcendence basis B such that S C BCT. 


2. Show that every algebraically independent set can be extended to a transcendence 
basis, and that every algebraically spanning set contains a transcendence basis. 


3. Prove carefully, for an extension E/F and a subset T = {t,...,tn} C E, that the 
following conditions are equivalent. 
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(i) T is algebraically independent over F;; 
(ii) For every i= 1,...,n, t; is transcendental over F(T \ {t:}); 
(iii) For every i = 1,...,n, t; is transcendental over F'(t,,...,t;-1) (where the state- 


ment for 7 = 1 is that t is transcendental over F’). 


4, Let S be a subset of E that is algebraically independent over F’. Show that ift €¢ E\S, 
then ¢ is transcendental over FS) if and only if SU {t} is algebraically independent 
over F’. 


[Problems 3 and 4 suggest the reasoning that is involved in formalizing the results of this 
section. ] 


5. Let F < K < E, with S$ a subset of K that is algebraically independent over F, and T 
a subset of F that is algebraically independent over K. Show that SUT is algebraically 
independent over F, and SAT = 9. 


6. Let F < K < FE, with S a transcendence basis for K/F and T a transcendence basis 
for E/K. Show that SUT is a transcendence basis for E/F’. Thus if tr deg abbreviates 
transcendence degree, then by Problem 5, 


trdeg(E£/F) = trdeg(K/F) + trdeg(E£/K). 


7. Let E be an extension of F, and T = {t1,...,tn} a finite subset of E. Show that 
F(T) is F-isomorphic to the rational function field F(X1,..., Xp) if and only if T is 
algebraically independent over F’. 


8. An algebraic function field F in one variable over K is a field F/K such that there 
exists x € F transcendental over K with [F': K(x)| < oo. If z € F, show that z is 
transcendental over K iff [F : K(z)] < co. 


9. Find the transcendence degree of the complex field over the rationals. 


Appendix To Chapter 6 


We will develop a method for calculating the discriminant of a polynomial and apply the 
result to a cubic. We then calculate the Galois group of an arbitrary quartic. 


A6.1 Definition 


If 21,...,@n (n > 2) are arbitrary elements of a field, the Vandermonde determinant of 
the Vi is 
1 1 1 
Ly v2 rn 
det V = 
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A6.2 Proposition 


i<j 


Proof. det V is a polynomial h of degree 1+2+---+ (n—1) = (3) in the variables 
£1,...,2n, a is g = [|], (x; — a). If a; = x; for i < j, then the determinant is 0, so 
by the remainder theorem (2.5.2), each factor of g, hence g itself, divides h. Since h and 
g have the same degree, h = cg for some constant c. Now look at the leading terms of h 
and g, i.e., those terms in which x, appears to as high a power as possible, and subject 
to this constraint, x, _, appears to as high a power as possible, etc. In both cases, the 
leading term is x27} ---a?~1, and therefore c must be 1. (For this step it is profitable to 
regard the «; as abstract variables in a polynomial ring. Then monomials x7" --- a7" with 
different sequences (71,...,1n) of exponents are linearly independent.) & 


A6.3 Corollary 


If f is a polynomial in F[X] with roots 21,...,2, in some splitting field over F, then the 
discriminant of f is (det V)?. 


Proof. By definition of the discriminant D of f (see 6.6.1), we have D = A? where 
=+detV. & 


A6.4 Computation of the Discriminant 


The square of the determinant of V is det(VV“), which is the determinant of 


n-1 
1 1 1 1 2 vy : 
ry r In 1 2 tn 
n-1 n-1 n-l1 n-l1 
xy x5 Hy 1 ey De 
and this in turn is 
to ty ct ty-a 
ty 5: tn 
tn—1 tn aes ton—2 


where the power sums t, are given by 
n 
to=n, oa r> 1. 
i=1 


We must express the power sums in terms of the coefficients of the polynomial f. This 
will involve, improbably, an exercise in differential calculus. We have 
n n 
F(z) = [[c —4j4z) = Saaz with cp = 1; 


i=l i=0 
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the variable z ranges over real numbers. Take the logarithmic derivative of F' to obtain 


F'(z) a8 nies: —2j 225g ys 
F(z) dz 1 — 442 : Fn 


i=1 i=1 j=0 


Thus 
co 
F'(z) + F(z) >_ tyy1z7 =0, 
j=0 
that is, 
S- icje’ | + S- Gz" by 7 ea 
i=1 i=0 j=l 


Equating powers of z’~!, we have, assuming that n > r, 
Cr + Coty + Cytp_y +--+ + Cp_ity = 0; (1) 
if r > n, the first summation does not contribute, and we get 
tr + Citp_y tess + Cytr_n = 0. (2) 


Our situation is a bit awkward here because the roots of F(z) are the reciprocals of the 2;. 
The 2; are the roots of )>"_9 ajz' where a; = Cn—; (so that a, = co = 1). The results can 
be expressed as follows. 


A6.5 Newton’s Identities 


If f(X) = oy, aX" (with a, = 1) is a polynomial with roots 71,...,2n, then the power 
sums t, satisfy 


tp + An—itp—1 + + Gn—rgiti + an—r = 0, ron (3) 
and 


tp + Qn—itr-1 +++: + dotrpen =0, r>n. (4) 


A6.6 The Discriminant of a Cubic 


First consider the case where the X? term is missing, so that f(X) = X°+pX +q. Then 
n = to = 3,da0 = ¢, 41 = p, dg = 0 (ag = 1). Newton’s identities yield 

ty Tag = 0,t1 = 0; to + agt, + 2a4 = 0, te = —2p; 

t3 ic agate ale a,ty bas 3a0 0, ts —34ado —3q; 

t4 Tr dats Tr ayte aa aot, = 0, t4 = —p(—2p) = 2p? 


34 CHAPTER 6. GALOIS THEORY 


We now go to the general case f(X) = X23 + aX? +bX +c. The quadratic term can be 
eliminated by the substitution Y = X + 3. Then 


f(x) =9Y) = (Y + a(Y 5) + OCs =) te 
_ pY 2 2a? ba 

y3 h ig a - 

eon ge 3 ve 27 3 ? 


Since the roots of f are translations of the roots of g by the same constant, the two 
polynomials have the same discriminant. Thus D = —4p* — 27q?, which simplifies to 


D = a?(b? — 4ac) — 4b° — 27c? + 18abe. 


We now consider the Galois group of a quartic X* + aX? + bX? + cX +d, assumed 
irreducible and separable over a field F. As above, the translation Y = X + 4 eliminates 
the cubic term without changing the Galois group, so we may assume that f(X) = 
X*++4qX?+7rX +s. Let the roots of f be 21,22,273,%4 (distinct by separability), and 
let V be the four group, realized as the subgroup of $4 containing the permutations 
(1, 2)(3,4), (1,3)(2, 4) and (1,4)(2,3), along with the identity. By direct verification (i.e., 
brute force), V < $4. If G is the Galois group of f (regarded as a group of permutations 
of the roots), then V MG <1 G by the second isomorphism theorem. 


A6.7 Lemma 
F(V NG) = F(u,v, w), where 


u= (%1+%2)(%3 +24), v= (21+ %3)(%2+ 24), w= (x14 £4)(%2 +23). 


Proof. Any permutation in V fixes u,v and w, so GF(u,v,w) D2 VNG. Ifo eG 
but o € VNG then (again by direct verification) o moves at least one of u,v,w. For 
example, (1,2,3) sends u to w, and (1,2) sends v to w. Thus o ¢ GF(u,v,w). Therefore 
GF(u,v,w) = VG, and an application of the fixed field operator F completes the 
proof. @& 


A6.8 Definition 


The resolvent cubic of f(X) = X*4+ qX?+rX 4+ 8 is g(X) =(X —u)(X -—v)(X —w). 

To compute g, we must express its coefficients in terms of q,r and s. First note that 
u-—v = —(41 — @4)(t2g— 23), U- w = —(@1 — #3)(XQ — 24), U-— WwW = —(41 — X2)(43 — L4). 
Thus f and g have the same discriminant. Now 


X44 gX*47X +65 (X7+kX +1 (X?-—kX +m) 


where the appearance of k and —k is explained by the missing cubic term. Equating 
coefficients gives 1+ m—k? = q,k(m—1) =1,lm = s. Solving the first two equations for 
m and adding, we have 2m = k? + q+ 1r/k, and solving the first two equations for | and 
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adding, we get 21 = k? + q—r/k. Multiply the last two equations and use lm = s to get 
a cubic in k?, namely 


k8 + Qgk* + (q? — 4s)k? — r? =0. 


(This gives a method for actually finding the roots of a quartic.) To summarize, 
f(X) = (X? + kX +1)(X? — kX +m) 

where k? is a root of 
A(X) = X34 2X? + (q? — 4s) X — 1’. 


We claim that the roots of h are simply —u, —v,—w. For if we arrange the roots of f so 
that 21 and x2 are the roots of X?-+kX +1, and x3 and x4 are the roots of X?-kX +m, 
then k = —(21 + %2),-k = —(a3 + 24), so —u = k?. The argument for —v and —w 
is similar. Therefore to get g from h, we simply change the sign of the quadratic and 
constant terms, and leave the linear term alone. 


A6.9 An Explicit Formula For The Resolvent Cubic: 


g(X) = X? — 2gX? + (q? —48)X +1’. 


We need some results concerning subgroups of S,, n > 3. 


A6.10 Lemma 
(i) An is generated by 3-cycles, and every 3-cycle is a commutator. 
(ii) The only subgroup of S;, with index 2 is Ap. 


Proof. For the first assertion of (i), see Section 5.6, Problem 4. For the second assertion 
of (i), note that 


(a, b)(a, c)(a, b)~'(a,c)~' = (a,b) (a, )(a, b)(a, c) = (a,b,c). 


To prove (ii), let H be a subgroup of S,, with index 2; H is normal by Section 1.3, 
Problem 6. Thus S;,/H has order 2, hence is abelian. But then by (5.7.2), part 5, 
S!' < H, and since A, also has index 2, the same argument gives Si, < A,. By (i), 
A, < Si, so A, = S), < H. Since A, and H have the same finite number of elements 
n!/2, it follows that H= A,. & 


A6.11 Proposition 


Let G be a subgroup of S,4 whose order is a multiple of 4, and let V be the four group 
(see the discussion preceding A6.7). Let m be the order of the quotient group G/(GNV). 
Then 


(a) If m=6, then G = Sy; 
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(b) If m = 38, then G= Ay; 

(c) Ifm=1, then G=V; 

(d) If m = 2, then G = Dg or Z4 or V; 

(e) If G acts transitively on {1,2,3,4}, then the case G = V is excluded in (d). [In all 


cases, equality is up to isomorphism.] 


Proof. If m = 6 or 3, then since |G| = m|GNV\, 3 is a divisor of |G]. By hypothesis, 4 is 
also a divisor, so |G| is a multiple of 12. By A6.10 part (ii), G must be S4 or Ay. But 


|S4/(SaV)| = |S4/V| = 24/4 = 6 
and 
|Ag/(Aa NV)| = |Aa/V| = 12/4 = 3 


proving both (a) and (b). If m = 1, then G = GNV, so G < V, and since |G| is a multiple 
of 4 and |V| = 4, we have G = V, proving (c). 

If m = 2, then |G| = 2|GN V, and since |V| = 4, |GNV| is 1, 2 or 4. If it is 1, 
then |G| = 2 x 1 = 2, contradicting the hypothesis. If it is 2, then |G| = 2 x 2 = 4, and 
G = Z, or V (the only groups of order 4). Finally, assume |GNV| = 4, so |G| = 8. But a 
subgroup of $4 of order 8 is a Sylow 2-subgroup, and all such subgroups are conjugate and 
therefore isomorphic. One of these subgroups is Dg, since the dihedral group of order 8 
is a group of permutations of the 4 vertices of a square. This proves (d). 

If m = 2, G acts transitively on {1,2,3,4} and |G| = 4, then by the orbit-stabilizer 
theorem, each stabilizer subgroup G(z) is trivial (since there is only one orbit, and its size 
is 4). Thus every permutation in G except the identity moves every integer 1,2,3,4. Since 
IGNV]| = 2,G consists of the identity, one other element of V, and two elements not in V, 
which must be 4-cycles. But a 4-cycle has order 4, so G must be cyclic, proving (e). & 


A6.12 Theorem 
Let f be an irreducible separable quartic, with Galois group G. Let m be the order of 


the Galois group of the resolvent cubic. Then: 

(a) If m=6, then G = Sq; 

(b) If m = 3, then G= Ay; 

(c) Ifm=1, then G=V; 
) 


If m = 2 and f is irreducible over L = F(u,v,w), where u,v and w are the roots of 
he resolvent cubic, then G = Dg; 


t 
(e) If m= 2 and f is reducible over L, then G = Z4. 


Proof. By A6.7 and the fundamental theorem, [G : GN V] = [L: F]. Now the roots of 
the resolvent cubic g are distinct, since f and g have the same discriminant. Thus L is 
a splitting field of a separable polynomial, so L/F is Galois. Consequently, [L : F'] =m 
by (3.5.9). To apply (A6.11), we must verify that |G| is a multiple of 4. But this follows 
from the orbit-stabilizer theorem: since G acts transitively on the roots of f, there is only 


6.9. TRANSCENDENTAL EXTENSIONS 37 


one orbit, of size 4 = |G|/|G(a)|. Now (A6.11) yields (a), (b) and (c), and if m = 2, then 
G= Dg or Z4. 

To complete the proof, assume that m = 2 and G = Dg. Thinking of Dg as the 
group of symmetries of a square with vertices 1,2,3,4, we can take Dg to be generated by 
(1,2,3,4) and (2,4), with V = {1, (1, 2)(3,4), (1,3)(2,4), (1,4)(2,3)}. The clements of V 
are symmetries of the square, hence belong to Dg; thus V = GNV = Gal(E/L) by (A6.7). 
[E is a splitting field for f over F’.] Since V is transitive, for each 1,7 = 1,2,3,4, 74 J, 
there is an L-automorphism 7 of E such that r(z;) = 2;. Applying 7 to the equation 
h(a;) = 0, where h is the minimal polynomial of x; over L, we see that each 2; is a root 
of h, and therefore f | h. But h | f by minimality of h, so h = f, proving that f is 
irreducible over L. 

Finally, assume m = 2 and G = Zy,, which we take as {1,(1,2,3,4), (1,3)(2, 4), 
(1,4,3,2)}. Then GOV = {1,(1,3)(2,4)}, which is not transitive. Thus for some i ¥ j, 
x; and x; are not roots of the same irreducible polynomial over L. In particular, f is 
reducible over L. & 


A6.13 Example 


Let f(X) = X+4+3X242X +1 over Q, with gq =3,r = 2,8 = 1. The resolvent cubic is, 
by (A6.9), g(X) = X°-6X?45X +44. To calculate the discriminant of g, we can use the 
general formula in (A6.6), or compute g(X + 2) = (X + 2)? —6(X +2)? +5(X+2)4+4= 
X3—7X —2. [The rational root test gives irreducibility of g and restricts a factorization 
of f to (X? +aX +1)(X? —aX +1), a € Z, which is impossible. Thus f is irreducible 
as well.| We have D(g) = —4(—7)? — 27(—2)? = 1264, which is not a square in Q. Thus 
m = 6, so the Galois group of f is S4. 
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Introducing Algebraic Number 
Theory 


(Commutative Algebra 1) 


The general theory of commutative rings is known as commutative algebra. The main 
applications of this discipline are to algebraic number theory, to be discussed in this 
chapter, and algebraic geometry, to be introduced in Chapter 8. 
Techniques of abstract algebra have been applied to problems in number theory for 
a long time, notably in the effort to prove Fermat’s Last Theorem. As an introductory 
example, we will sketch a problem for which an algebraic approach works very well. If p 
is an odd prime and p = 1 mod 4, we will prove that p is the sum of two squares, that is, 
p can be expressed as x? + y? where x and y are integers. Since pot is even, it follows 
that -1 is a quadratic residue (that is, a square) mod p. [Pair each of the numbers 2,3, 
. ,p — 2 with its multiplicative inverse mod p and pair 1 with p— 1 = —1 mod p. The 
product of the numbers 1 through p — 1 is, mod p, 
= ee 
2 


1x2x---x en ee ee 


and therefore [(25+)!]? = —-1 mod p]] 

If —1 = 2? mod p, then p divides x? + 1. Now we enter the ring of Gaussian integers 
and factor x? + 1 as (2+ 1%)(a— i). Since p can divide neither factor, it follows that p is 
not prime in Z[i], so we can write p = a where neither a nor @ is a unit. 

Define the norm of y = a+ bi as N(y) = a? +67. Then N(y) = 1 iff y = +1 or +i 
iff y is a unit. (See Section 2.1, Problem 5.) Thus 


p’ = N(p) = N(a)N(8) with N(a) > 1 and N() > 1, 
so N(a) = N(8) =p. Ifa=a+ iy, then p= 2? + y?. 


1 
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Conversely, if p is an odd prime and p = «7+ y?, then p is congruent to 1 mod 4. (If x 
is even, then x? = 0 mod 4, and if x is odd, then 2? = 1 mod 4. We cannot have x and y 
both even or both odd, since p is odd.) 

It is natural to conjecture that we can identify those primes that can be represented as 
x? + |dly?, where d is a negative integer, by working in the ring Z[Vd]. But the Gaussian 
integers (d = —1) form a Euclidean domain, in particular a unique factorization domain. 
On the other hand, unique factorization fails for d < —3 (Section 2.7, Problem 7), so the 
above argument collapses. [Recall from (2.6.4) that in a UFD, an element p that is not 
prime must be reducible.] Difficulties of this sort led Kummer to invent “ideal numbers” , 
which later became ideals at the hands of Dedekind. We will see that although a ring of 
algebraic integers need not be a UFD, unique factorization of ideals will always hold. 


7.1 Integral Extensions 


If E/F is a field extension and a € FE, then a is algebraic over F iff a is a root of a 
polynomial with coefficients in F’. We can assume if we like that the polynomial is monic, 
and this turns out to be crucial in generalizing the idea to ring extensions. 


7.1.1 Definitions and Comments 


In this chapter, unless otherwise specified, all rings are assumed commutative. Let A be 
a subring of the ring R, and let x € R. We say that x is integral over A if x is a root of a 
monic polynomial f with coefficients in A. The equation f(X) = 0 is called an equation 
of integral dependence for x over A. If x is a real or complex number that is integral 
over Z, then « is called an algebraic integer. Thus for every integer d, Vd is an algebraic 
integer, as is any n*” root of unity. (The monic polynomials are, respectively, X? — d 
and X" — 1.) In preparation for the next result on conditions equivalent to integrality, 
note that Ala], the set of polynomials in x with coefficients in A, is an A-module. (The 
sum of two polynomials is a polynomial, and multiplying a polynomial by a member of A 
produces another polynomial over A.) 


7.1.2 Proposition 


Let A be a subring of R, with x € R. The following conditions are equivalent: 


(i) x is integral over A; 
(ii) The A-module Als] is finitely generated; 


(iii) x belongs to a subring B of R such that A C B and B is a finitely generated A- 
module. 


Proof. (i) implies (ii). If x is a root of a monic polynomial over A of degree n, then x” 
and all higher powers of x can be expressed as linear combinations of lower powers of z. 
Thus 1,2,2?,,...,2"~1 generate A[z] over A. 

(ii) implies (iii). Take B = A[z]. 
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(iii) implies (i). If G1,...,8n generate B over A, then xG; is a linear combination of 
the 3;, say 73; = ae ci; 3;. Thus if 3 is a column vector whose components are the /3;, 
I is an n by n identity matrix, and C = [c,,], then 


(aI —C)G=0, 
and if we premultiply by the adjoint matrix of «J — C (as in Cramer’s rule), we get 
[det(al — C)|IG6 =0, 


hence [det(aJI — C)|b = 0 for every b € B. Since B is a ring we may set b = 1 and conclude 
that x is a root of the monic polynomial det(XJ —C) in A[X]. & 


For other equivalent conditions, see Problems 1 and 2. 
We are going to prove a transitivity property for integral extensions (analogous to 
(3.3.5)), and the following result will be helpful. 


7.1.3 Lemma 


Let A be a subring of R, with 21,...,¢%, € R. If x1 is integral over A, x2 is integral 
over A[x,], ..., and x, is integral over Ala,,...,%p—1], then Ala,,...,2,] is a finitely 
generated A-module. 


Proof. The n = 1 case follows from (7.1.2), part (ii). Going from n — 1 to n amounts 
to proving that if A, B and C are rings, with C a finitely generated B-module and Ba 
finitely generated A-module, then C is a finitely generated A-module. This follows by a 
brief computation: 


C=) > By, B=) Am so C=S YS) Aye. & 
j=l k=1 


j=l k=1 


7.1.4 Transitivity of Integral Extensions 


Let A, B and C be subrings of R. If C is integral over B, that is, each element of C is 
integral over B, and B is integral over A, then C is integral over A. 


Proof. Let x € C, with 2” + b,_ya"~1 +-+-+b 2+ bp = 0, b; € B. Then z is integral 
over Albo,..-,0n—1]. Each 6; is integral over A, hence over A[bo,...,b;-1]. By (7.1.3), 
Albo,...,0n—1,2] is a finitely generated A-module. By (7.1.2), part (iii), x is integral 
over A. & 


7.1.5 Definitions and Comments 


If A is a subring of R, the integral closure of A in R is the set A, of elements of R that 
are integral over A. Note that A C A, because each a € A is a root of X —a. We say that 
A is integrally closed in Rif A, = A. If we simply say that A is integrally closed without 
reference to R, we assume that A is an integral domain with quotient field K, and A is 
integrally closed in K. 
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If a and y are integral over A, then just as in the proof of (7.1.4), it follows from 
(7.1.3) that A[zx,y] is a finitely generated A-module. Since x + y,x — y and zy belong to 
this module, they are integral over A by (7.1.2) part (iii). The important conclusion is 
that 


A¢ is a subring of R containing A. 


If we take the integral closure of the integral closure, we get nothing new. 


7.1.6 Proposition 
The integral closure A, of A in R is integrally closed in R. 


Proof. By definition, A, is integral over A. If x is integral over A,, then as in the proof 
of (7.1.4), x is integral over A, so that r€ Ac. & 


We can identify a large class of integrally closed rings. 


7.1.7 Proposition 
If A is a UFD, then A is integrally closed. 


Proof. If x belongs to the quotient field K, then we can write x = a/b where a,b € A, 
with a and 6 relatively prime. If x is integral over A, then there is an equation of the form 


(a/b)” + dn—1(a/b)"—+ eee cal ay(a/b) +aj =0 


with a; € A. Multiplying by 6”, we have a” + be = 0, with c € A. Thus 6 divides a”, 
which cannot happen for relatively prime a and 6 unless b has no prime factors at all, in 
other words, b is a unit. But then =ab-!C A. & 


We can now discuss one of the standard setups for doing algebraic number theory. 


7.1.8 Definitions and Comments 


A number field is a subfield L of the complex numbers C such that L is a finite extension 
of the rationals Q. Thus the elements of L are algebraic numbers. The integral closure 
of Z in L is called the ring of algebraic integers (or simply integers) of L. In the next 
section, we will find the algebraic integers explicitly when LD is a quadratic extension. 


Problems For Section 7.1 


1. Show that in (7.1.2) another equivalent condition is the following: 
(iv) There is a subring B of R such that B is a finitely generated A-module and 
tBCesB. 
If R is a field, show that the assumption that B is a subring can be dropped (as long 
as B #0). 
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2. A module is said to be faithful if its annihilator is 0. Show that in (7.1.2) the following 
is another equivalent condition: 


(v) There is a faithful A[x]-module B that is finitely generated as an A-module. 


Let A be a subring of the integral domain B, with B integral over A. In Problems 3-5 
we are going to show that A is a field if and only if B is a field. 


3. Assume that B is a field, and let a be a nonzero element of A. Then since a~! € B, 
there is an equation of the form 


(a-4y" + aay free cia} + Co = 0 


with c; € A. Show that a~! € A, proving that A is a field. 


4. Now assume that A is a field, and let b be a nonzero element of B. By (7.1.2) part (ii), 
Alb] is a finite-dimensional vector space over A. Let f be the A-linear transformation 
on this vector space given by multiplication by b, in other words, f(z) = bz,z € Alb]. 
Show that f is injective. 


5. Show that f is surjective as well, and conclude that B is a field. 


In Problems 6-8, let A be a subring of B, with B integral over A. Let Q be a prime 
ideal of B and let P= QN A. 


6. Show that P is a prime ideal of A, and that A/P can be regarded as a subring of B/Q. 
7. Show that B/Q is integral over A/P. 


8. Show that P is a maximal ideal of A if and only if Q is a maximal ideal of B. 


7.2 Quadratic Extensions of the Rationals 


We will determine the algebraic integers of L = Q(Vd), where d is a square-free integer 
(a product of distinct primes). The restriction on d involves no loss of generality; for 
example, Q(V12) = Q(V3). The minimal polynomial of Vd over Q is X? — d, which has 
roots +d. The extension L/Q is Galois, and the Galois group consists of the identity 
and the automorphism o(a+ bVd) = a— bd, a,b€ Q. 

A remark on notation: To make sure that there is no confusion between algebraic 
integers and ordinary integers, we will use the term rational integer for a member of Z. 


7.2.1 Lemma 


If a and b are rational numbers, then a+ bV4d is an algebraic integer if and only if 2a and 
a” — db’ belong to Z. In this case, 2b is also in Z. 


Proof. Let « = a+ bVd, so that o(a) = a—bVd. Then x + a(x) = 2a € Q and 
xo(xz) = a* — db* € Q. Now if z is an algebraic integer, then x is a root of a monic 
polynomial f € Z[X]. But f(o(x)) = o(f(x)) since o is an automorphism, so a(x) is 
also a root of f and hence an algebraic integer. By (7.1.5), 2a and a? — db? are also 
algebraic integers, as well as rational numbers. By (7.1.7), Z is integrally closed, so 2a 
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and a? — db? belong to Z. The converse holds because a+ bVd is a root of (X — a)? = db?, 
ie., X? —2aX +a? — db? =0. 

Now if 2a and a? — db? are rational integers, then (2a)? — d(2b)? = 4(a? — db?) € Z, so 
d(2b)? € Z. If 2b ¢ Z, then its denominator would include a prime factor p , which would 
appear as p” in the denominator of (2b)?. Multiplication of (2b)? by d cannot cancel the 
p* because d is square-free, and the result follows. & 


7.2.2 Corollary 
The set B of algebraic integers of Q(V4d), d square-free, can be described as follows. 


(i) Ifd#1 mod 4, then B consists of all a + bVd, a,b € Z; 


(ii) If d= 1 mod 4, then B consists of all $ + $Vd, u,v € Z, where u and v have the 
same parity (both even or both odd). 


[Note that since d is square-free, it is not divisible by 4, so the condition in (i) is d = 2 or 
3 mod 4,] 


Proof. By (7.2.1), the algebraic integers are of the form 4 + g/d where u,v € Z and 
Z - ae € Z, ie., u? — dv? = 0 mod 4. It follows that u and v have the same parity. 
(The square of an even number is congruent to 0 and the square of an odd number to 1 
mod 4.) Moreover, the “both odd” case can only occur when d = 1 mod 4. The “both 


even” case is equivalent to 5,5 € Z, and the result follows. & 


We can express these results in a more convenient form. We will show in (7.4.10) 
that the set B of algebraic integers in any number field L is a free Z-module of rank 
n= [L:Q|. A basis for this module is called an integral basis or Z-basis for B. 


7.2.3 Theorem 

Let B be the algebraic integers of Q(Vd), d square-free. 

(i) If d#1 mod 4, then 1 and Vd form an integral basis of B; 

(ii) If d= 1 mod 4, then 1 and $(1+ Vd) form an integral basis. 

Proof. (i) By (7.2.2), 1 and Vd span B over Z, and they are linearly independent because 
Jd is irrational. 


(ii) By (7.2.2), 1 and $(1+ Vd) are algebraic integers. To show that they span B, 
consider $(u + vVd), where u and v have the same parity. Then 


sutovd) = (4F*) a) +0 /sa+va| 


with 4; and v in Z. Finally, to show linear independence, assume that a,b € Z and 


a+o[5a+va| =0. 


Then 2a +b +bVd = 0, which forceesa=b=0. & 


7.3. NORMS AND TRACES il 


Problems For Section 7.2 


1. Let L = Q(a), where a is a root of the irreducible quadratic X?+bX +c, with b,c € Q. 
Show that D = Q(v 4d) for some square-free integer d. Thus the analysis of this section 
covers all possible quadratic extensions of Q. 


2. Show that the quadratic extensions Q(Vd), d square-free, are all distinct. 


3. Continuing Problem 2, show that in fact no two distinct quadratic extensions of Q are 
Q-isomorphic. 


Cyclotomic fields do not exhibit the same behavior. Let w, = e'27/", a primitive n*” root 
of unity. By a direct computation, we have w3,, = wn, and 


aor _ —eit(n+1)/n = e'™ eT pin/n = Won: 


4. Show that if n is odd, then Q(wn) = Q(wan). 

5. If x is an algebraic integer, show that the minimal polynomial of x over Q has coeffi- 
cients in Z. (This will be a consequence of the general theory to be developed in this 
chapter, but it is accessible now without heavy machinery.) Consequently, an algebraic 
integer that belongs to Q in fact belongs to Z. (The minimal polynomial of r € Q over 
Qis X —r.) 

6. Give an example of a quadratic extension of Q that is also a cyclotomic extension. 


7. Show that an integral basis for the ring of algebraic integers of a number field L is, in 
particular, a basis for D over Q. 


7.3. Norms and Traces 


7.3.1 Definitions and Comments 


If E/F is a field extension of finite degree n, then in particular, E is an n-dimensional 
vector space over F’, and the machinery of basic linear algebra becomes available. If x is 
any element of E', we can study the F-linear transformation m(x) given by multiplication 
by az, that is, m(a)y = xy. We define the norm and the trace of z, relative to the extension 
E/F, as 
N[E/F]\(x) = det m(x) and T|E/F|(x) = trace m(2). 
We will write N(x) and T(z) if E/F is understood. If the matrix A(x) = [a;;(x)] repre- 
sents m(x) with respect to some basis for E over F’, then the norm of x is the determinant 
of A(a) and the trace of x is the trace of A(x), that is, the sum of the main diagonal 
entries. The characteristic polynomial of x is defined as the characteristic polynomial of 
the matrix A(x), that is, 
char[E'/F](x) = det[XI — A(x)| 


where J is an n by n identity matrix. If E/F is understood, we will refer to the charac- 
teristic polynomial of x, written char(z). 
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7.3.2 Example 
Let E = Cand F =R. A basis for C over R is {1,72} and, with 7 = a + bi, we have 
(a+ bi)(1) = a(1) + 0(2) and (a + bi)(z) = —b(1) + a(t). 


Thus 


A(a+ bi) = E | 


The norm, trace and characteristic polynomial of a+ bi are 
N(a+ bi) =a? +0", T(a+ bi) = 2a, char(a+ bi) = X? — 2aX +0740. 


The computation is exactly the same if F = Q(i) and F = Q. Notice that the coefficient 
of X is minus the trace and the constant term is the norm. In general, it follows from the 
definition of characteristic polynomial that 


char(z) = X" —T(x)X" 1 4+---+(-1)"N(za). 
[The only terms multiplying X”~! in the expansion of the determinant are —a;;(z), 
i=1,...,n. Set X =0 to show that the constant term of char(x) is (—1)” det A(z). 


7.3.3 Lemma 


If & is an extension of F and x € E, then N(x), T(x) and the coefficients of char(zx) 
belong to F. Ifa € F, then 


N(a) =a", T(a) = na, and char(a) = (X — a)”. 


Proof. The first assertion follows because the entries of the matrix A(x) are in F. The 
second statement holds because if a € F, the matrix representing multiplication by a 
isal. & 


It is natural to look for a connection between the characteristic polynomial of « and 
the minimal polynomial of x over F’. 


7.3.4 Proposition 


char[E/F](x) = [min(a, F’)]" 
where r = [E: F(2)]. 


Proof. First assume that r = 1, so that E = F(x). By the Cayley-Hamilton theorem, 
the linear transformation m(a) satisfies char(«), and since m(x) is multiplication by z, 
x itself is a root of char(x). Thus min(x, F) divides char(x). But both polynomials 
have degree n, and the result follows. In the general case, let y,,...,ys; be a basis for 
F(a) over F, and let z1,...,2- be a basis for E over F(x). Then the y,z; form a basis 
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for E over F. Let A = A(x) be the matrix representing multiplication by x in the 
extension F(x)/F', so that ry; = >>, deiye, and x(yiz;) = 2, Gei(yrz;). Order the basis 
for E/F as y121, Y2o215 ++ +5 YsZ13 Y1225 Y2225 ++ Ys223 00-5 YLSr5 Y22ry +++ Ys2r- Then m(x) 
is represented in E’/F as 


AO... 0 
0 A... O 
0 0... A 
Thus char[E/F](a) = [det(XI — A)]", which by the r = 1 case coincides with 


{min(a,F)|". & 


7.3.5 Corollary 


Let [BE : F] =n, and [F(x): F] =d. Let 7,...,2q be the roots of min(x, F) in a splitting 
field (counting multiplicity). Then 


and 


Proof. The formula for the characteristic polynomial follows from (7.3.4). The norm is 
(—1)” times the constant term of char(a) (see (7.3.2)), hence is 


d n/d 
(-1)"(-1)" (11 «| | 


Finally, if min(«, F) = X¢+ ag_,X*!4+---+a,X +ap, then the coefficient of X"~! in 
[min(a, F)|"/? is aq-1=—-4 ey x;. Since the trace is the negative of this coefficient 
[see (7.3.2)], the result follows. d& 


If E is a separable extension of F’, there are very useful alternative expressions for the 
trace and norm. 


7.3.6 Proposition 


Let E/F be a separable extension of degree n, and let o1,...,0n be the distinct F- 
monomorphisms of F into an algebraic closure of EF, or equally well into a normal extension 
L of F containing &. Then 


n 


T[E/F\(x) =~ o;(x) and N[E/F\(2) = II o;(2). 


i=l 
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Consequently, T(ax + by) = aT (x) + bT(y) and N(ay) = N(x)N(y) for 2, y € E,a,be F. 


Proof. Each of the d distinct F- embeddings 7, of F(x) into L takes x into a unique 
conjugate x;, and extends to exactly 4 = [E : F(«x)] F-embeddings of FE into L, all of 
which also take x to x; [see (3.5.1), (3.2.3) and (3 5.2)]. Thus 


and 


by (7.3.5). & 


The linearity of T and the multiplicativity of N hold without any assumption of 
separability, since in (7.3.1) we have m(axz + by) = am(x) + bm(y) and m(xy) = m(ax) o 
m(y). 


7.3.7 Corollary (Transitivity of Trace and Norm) 
If F< K < E, where E/F is finite and separable, then 
T|E/F] =T|K/F] 0 T[E/K] and N[E/F] = N[K/F] 0 N[E/K]. 


Proof. Let o1,...,0n be the distinct F-embeddings of K into LD, and let 71,..., 7m be the 
distinct K-embeddings of E into L, where L is the normal closure of E over F’. By (6.3.1) 
and (3.5.11), L/F is Galois, and by (3.5.2), (3.5.5) and (3.5.6), each mapping o; and 7; 
extends to an automorphism of L. Therefore it makes sense to allow the mappings to be 
composed. By (7.3.6), 


n m n 


T[K/F\(T[E/K])(2) = do oi( > 74(@)) = S2o eta(a) 


i=1 j=l i=1 j=1 


Now each o;7; is an F-embedding of EF into L, and the number of mappings is 
mn = [E: K\[K : F] = [(E: F]. Furthermore, the o;7; are distinct when restricted 
to E. For if o;7; = opm on E, hence on K, then o; = o, on K (because 7; = 7] = 
the identity on K). Thus i = k, so that 7; = 7 on E. But then j = 1. By (7.3.6), 
T|K/F\(T|E/K])(«) = T[E/F](x). The norm is handled the same way, with sums re- 
placed by products. 


7.3.8 Corollary 


If E'/F is a finite separable extension, then T|E/F](x) cannot be 0 for all x € E. 


Proof. If T(x) = 0 for all x, then by (7.3.6), }°_, oi(x) = 0 for all x. This contradicts 
Dedekind’s lemma (6.1.6). & 
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A statement equivalent to (7.3.8) is that if E/F is finite and separable, then the “trace 
form” [the bilinear form (x,y) — T|E/F|(xy)] is nondegenerate, i.e., if T(vy) = 0 for all 
y, then x = 0. For if ¢ 4 0,T(2o) 4 0, and T(xy) = 0 for all y, choose y so that xy = xo 
to reach a contradiction. 


7.3.9 The Basic Setup For Algebraic Number Theory 


Let A be an integral domain with quotient field kK, and let L be a finite separable extension 
of kK. Let B be the set of elements of LZ that are integral over A, that is, B is the integral 
closure of A in L. The diagram below summarizes all the information. 


L — B 


In the most important special case, A = Z, K = Q, L is a number field, and B is the ring 
of algebraic integers of L. From now on, we will refer to (7.3.9) as the AK LB setup. 


7.3.10 Proposition 


If « € B, then the coefficients of char[L/K](x) and min(«, K) are integral over A. In 
particular, T|L/K](x) and N[L/K](x) are integral over A, by (7.3.2). If A is integrally 
closed, then by (7.3.3), the coefficients belong to A. 


Proof. The coefficients of min(xz, K) are sums of products of the roots x;, so by (7.1.5) 
and (7.3.4), it suffices to show that the x; are integral over A. Each x; is a conjugate of x 
over K, so by (3.2.3) there is a K-isomorphism 7;: K(a) — K(a;) such that 7;(”) = 2. 
If we apply 7; to an equation of integral dependence for x over A, we get an equation of 
integral dependence for x; over A. @& 


Problems For Section 7.3 


1fE= Q(Vd) and « =a+bV/d€ E, find the norm and trace of z. 


2. If E = Q(A) where @ is a root of the irreducible cubic X° — 3X + 1, find the norm 
and trace of 6?. 


3. Find the trace of the primitive 6” root of unity w in the cyclotomic extension Qe. 
We will now prove Hilbert’s Theorem 90: If E/F is a cyclic extension with [E : F] =n 
and Galois group G = {1,¢,...,0"~1} generated by o, and x € E, then 
(i) N(x) = 1 if and only if there exists y € E such that « = y/a(y); 
(ii) T(x) = 0 if and only if there exists z € EF such that x = z — o(z). 
4. Prove the “if” parts of (i) and (ii) by direct computation. 


7! are linearly independent over E, so 


By Dedekind’s lemma, 1,¢,07,...,0 
1+a0+4 2o(2)o? +---+ 20(2)---0"?(x)o"" 


is not identically 0 on EF. 
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5. Use this to prove the “only if” part of (i). 
By (7.3.8), there is an element u € EF whose trace is not 0. Let 


w = xo(u) + (x + o(x))o?(u) + +--+ (w@+o(2) ++» +a" (a))o"™ (u) 
hence 
o(w) = o(x)o*(u) + (a(x) + 07(x))o°(u) +--+ + (a(x) + 07(2) +++» +" *(x))o"(u) 
6. If T(x) = 0, show that w — o(w) = «T(u). 
7. If z=w/T(u), show that z— o(z) = a, proving the “only if” part of (ii). 
8. In Hilbert’s Theorem 90, are the elements y and z unique? 
9. Let 6 be a root of X*— 2 over Q. Find the trace over Q of 0, 67, 6? and 30. 


10. Continuing Problem 9, show that 3 cannot belong to Q{6]. 


7.4 The Discriminant 


We have met the discriminant of a polynomial in connection with Galois theory (Sec- 
tion 6.6). There is also a discriminant in algebraic number theory. The two concepts are 
unrelated at first glance, but there is a connection between them. We assume the basic 
AKLB setup of (7.3.9), with n = [L: K]. 


7.4.1 Definition 
The discriminant of the n-tuple x = (x1,...,%n) of elements of L is 
D(x) = det(T[L/K](a:2;)). 
Thus we form a matrix whose 7j element is the trace of x;x;, and take the determinant 
of the matrix. By (7.3.3), D(x) belongs to K. If the x; are in B, then by (7.3.10), D(x) 


is integral over A, hence belongs to A if A is integrally closed. 
The discriminant behaves quite reasonably under linear transformation: 


7.4.2 Lemma 


If y = Ca, where C is an n by n matrix over kK and « and y are n-tuples written as 
column vectors, then D(y) = (det C)?D(z). 


Proof. The trace of y,yz is 


T( xs Critajizs) = s Cri LT (LiL; ) C85 
tJ 


hence 
(T(yrys)) = C(L (aia) )C 


where C” is the transpose of C. The result follows upon taking determinants. d& 
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Here is an alternative expression for the discriminant. 


7.4.3 Lemma 


Let o1,...,0n be the K-embeddings of L into an algebraic closure of L, as in (7.3.6). 
Then D(x) = [det(o;(x;))]?. 

Thus we form the matrix whose ij element is o;(x,;), take the determinant and square 
the result. 


Proof. By (7.3.6), 


Tae) = Son eas) =>: one on(a;) 


k k 
so if C is the matrix whose ij entry is o;(x,;), then 
(Tee;))= CC 
and again the result follows upon taking determinants. d& 


The discriminant “discriminates” between bases and non-bases, as follows. 


7.4.4 Proposition 
If ~ = (@1,...,@n), then the x; form a basis for L over K if and only if D(x) 40. 


Proof. If >), cj; = 0, with the c; € K and not all 0, then 7, cjo;(2j) = 0 for all 2, so the 
columns of the matrix B = (0;(x;)) are linearly dependent. Thus linear dependence of 
the x; implies that D = 0. Conversely, assume that the x; are linearly independent (and 
therefore a basis since n = [L : K]). If D = 0, then the rows of B are linearly dependent, 
so for some c; € K, not all 0, we have 5°; c;o;(x;) = 0 for all j. Since the x; form a basis, 
we have )°, co;(u) = 0 for all u € L, so the monomorphisms o; are linearly dependent. 
This contradicts Dedekind’s lemma. & 


We now make the connection between the discriminant defined above and the discrim- 
inant of a polynomial defined previously. 


7.4.5 Proposition 


Assume that L = K(x), and let f be the minimal polynomial of « over K. Let D be the 
discriminant of the basis 1,z,z?,...,2"~' for L over K. Then D is the discriminant of 
the polynomial f. 


Proof. Let x1,...,@% be the roots of f in a splitting field, with x; = x. Let a; be the 
K-embedding that takes x to xj, i =1,...,n. Then o;(2/) = x1, so by (7.4.3), D is the 
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square of the determinant of the matrix 


Lay ee as ae 
-1 

1 ty: 08 ie 
2 n—-1 

Le gt BES ek SLE 


and the result follows from the formula for a Vandermonde determinant; see (A6.2). & 


7.4.6 Corollary 
Under the hypothesis of (7.4.5), 


where f’ is the derivative of f. 


Proof. Let a= 1), By (7.4.5), 
D= [[@ —a2;= a] I(x: —2;)= al [fj — %;). 
i<j tAj a j#i 
But f(X) = (X — 21)---(X —2z,), so 
f'(w) = >> [](* - 25) 
k Xk 


with X replaced by x;. When X is replaced by x;, only the k = 7 term is nonzero, hence 


j#t 
Consequently, 
D=a Il f' (a) 
i=1 
But 
f'(ai) = f'(oi(z)) = oi f(x) 
so by (7.3.6), 


The discriminant of an integral basis for a number field has special properties. We 
will get at these results by considering the general AK LB setup, adding some additional 
conditions as we go along. 
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7.4.7 Lemma 


There is a basis for L/K consisting entirely of elements of B. 


Proof. Let 21,...,%n be a basis for ZL over K. Each x; is algebraic over K, and therefore 
satisfies a polynomial equation of the form 


Om UX,” + +++ + a,4; + a9 = 0 


with am #0 and the a; € A. (Initially, we only have a; € K, but then a; is the ratio of 
two elements in A, and we can form a common denominator.) Multiply the equation by 
a™~1 to obtain an equation of integral dependence for y; = ax; over A. The y; form 
the desired basis. & 


7.4.8 Theorem 


Suppose we have a nondegenerate symmetric bilinear form on an n-dimensional vector 
space V, written for convenience using inner product notation (2, y). If a1,...,@n is any 
basis for V, then there is a basis y1,..., Yn for V, called the dual basis referred to V, such 
that 


1, t=9; 
5,43) = Os3 = 
a {i iA ij. 
This is a standard (and quite instructive) result in linear algebra, and it will be de- 
veloped in the exercises. 


7.4.9 Theorem 


If A is a principal ideal domain, then B is a free A-module of rank n. 


Proof. By (7.3.8), the trace is a nondegenerate symmetric bilinear form on the 
n-dimensional vector space L over K. By (7.1.7), A is integrally closed, so by (7.3.10), 
the trace of any element of L belongs to A. Now let 21,...,%, be any basis for L over K 
consisting of algebraic integers, and let y1,...,Yn be the dual basis referred to L (see 
(7.4.8)). If z € B, then we can write z = )°/_, ajy; with a; ¢ K. We know that the 
trace of x;z belongs to A, and we also have 


T(a;2z) = 203) ajiy;) = S- ajT (xiy;) = ye 05045 = aj. 
g=1 j=l j=l 


Thus each a; belongs to A, so that B is an A-submodule of the free A-module @7_, Ayj. 
By (4.6.2), B is a free A-module of rank at most n. But by (7.4.7), B contains a basis for 
L over K, and if we wish, we can assume that this basis is 71,...,%,. Then B contains 
the free A-module 67_, Ax;, so the rank of B as an A-module is at least n, and hence 
exactly n. de 
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7.4.10 Corollary 


The set B of algebraic integers in any number field L is a free Z-module of rank n = [L : Ql). 
Therefore B has an integral basis. The discriminant is the same for every integral basis; 
it is known as the field discriminant. 


Proof. Take A = Z in (7.4.9) to show that B has an integral basis. The transformation 
matrix C between two integral bases (see (7.4.2)) is invertible, and both C and C7! have 
rational integer coefficients. Take determinants in the equation CC~! = I to conclude 
that det C is a unit in Z. Therefore det C = +1, so by (7.4.2), all integral bases have the 
same discriminant. @& 


Problems For Section 7.4 


Let 21,...,2n be a basis for the vector space V, and let (x,y) be a nondegenerate sym- 
metric bilinear form on V. We now supply the details of the proof of (7.4.8). 


1. For any y € V, the mapping x — (a, y) is a linear form [(y), i-e., a linear map from 
V to the field of scalars. Show that the linear transformation y — I(y) from V to V%*, 
the dual space of V (i.e., the space of all linear forms on V), is injective. 

2. Show that any linear form on V is I(y) for some y € V. 

3. Let fi,..-, fn be the dual basis corresponding to 71,...,%,. Thus each f; belongs to 
V* (not V) and f;(2;) = 6;;. If f; = l(y;), show that y1,...,yn is the required dual 
basis referred to V. 

4, Show that x; = )°"_,(«i,vj)yj. Thus in order to compute the dual basis referred to 
V in terms of the original basis, we must invert the matrix ((x;,7;)). 


5. A matrix C with coefficients in Z is said to be unimodular if its determinant is +1. 
Show that C' is unimodular if and only if C is invertible and its inverse has coefficients 
in Z. 

6. Show that the field discriminant of the quadratic extension Q(v4d), d square-free, is 


D- 4d if d#1 mod 4; 
~ )d ifd=1 mod 4. 


7. Let %1,...,% be arbitrary algebraic integers in a number field, and consider the 
determinant of the matrix (0;(x;)), as in (7.4.3). The direct expansion of the deter- 
minant has n! terms. Let P be the sum of those terms in the expansion that have 
plus signs in front of them, and N the sum of those terms prefixed by minus signs. 
Thus the discriminant D of (x1,...,@n) is (P — N)?. Show that P+ N and PN are 
fixed by each o;, and deduce that P+ N and PN are rational numbers. 


8. Continuing Problem 7, show that P+ N and PN are rational integers. 
9. Continuing Problem 8, prove Stickelberger’s theorem: D = 0 or 1 mod 4. 


10. Let Z be a number field of degree n over Q, and let y1,...,Y, be a basis for L over Q 
consisting of algebraic integers. Let x1,...,2» be an integral basis. Show that if the 
discriminant D(y1,...,Yn) is square-free, then each x; can be expressed as a linear 
combination of the y; with integer coefficients. 
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11. Continuing Problem 10, show that if D(yi,...,Yn) is square-free, then y1,...,Yn is 
an integral basis. 


12. Is the converse of the result of Problem 11 true? 
13. In the standard AK LB setup (see (7.3.9)), show that L is the quotient field of B. 


7.5 Noetherian and Artinian Modules and Rings 


7.5.1 Definitions and Comments 


In this section, rings are not assumed commutative. Let M be an R-module, and suppose 
that we have an increasing sequence of submodules M, < My < M3 <..., or a decreasing 
sequence M, > My > M3 > .... We say that the sequence stabilizes if for some tf, 
M, = Misi = Miyo2 = .... The question of stabilization of sequences of submodules 
appears in a fundamental way in many areas of abstract algebra and its applications. We 
have already made contact with the idea; see (2.6.6) and the introductory remarks in 
Section 4.6. 

The module M is said to satisfy the ascending chain condition (acc) if every increasing 
sequence of submodules stabilizes; M satisfies the descending chain condition (dcc) if every 
decreasing sequence of submodules stabilizes. 


7.5.2 Proposition 


The following conditions on an R-module M are equivalent, and define a Noetherian 
module: 


(1) M satisfies the acc; 


(2) Every nonempty collection of submodules of M has a maximal element (with respect 
to inclusion). 


The following conditions on M are equivalent, and define an Artinian module: 


(1’) M satisfies the dcc; 


(2’) Every nonempty collection of submodules of M has a minimal element. 


Proof. Assume (1), and let S be a nonempty collection of submodules. Choose Mj € S. 
If MM, is maximal, we are finished; otherwise we have M, < Mz for some Mz € S. If we 
continue inductively, the process must terminate at a maximal element; otherwise the acc 
would be violated. 

Conversely, assume (2), and let M,; < Mz < .... The sequence must stabilize; oth- 
erwise {M,, M>,...} would be a nonempty collection of submodules with no maximal 
element. The proof is exactly the same in the Artinian case, with all inequalities re- 
versed. & 


There is another equivalent condition in the Noetherian case. 
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7.5.3 Proposition 
M is Noetherian iff every submodule of / is finitely generated. 


Proof. If the sequence My, < Mz <... does not stabilize, let N =US2,M,. Then N isa 


submodule of M, and it cannot be finitely generated. For if x,,...,7, generate N, then 
for sufficiently large t, all the x; belong to M;. But then N C M; C Mi4, C-:- CN, 
so Mt = Mi4, = .... Conversely, assume that the acc holds, and let N < M. If 


N # 0, choose 71 € N. If Ra, = N, then N is finitely generated. Otherwise, there 
exists 22 ¢ Ra,. If x1 and x2 generate N, we are finished. Otherwise, there exists 
v3 ¢ Ra, + Rag. The acc forces the process to terminate at some stage t, in which case 
%1,--.-,% generate N. & 


The analogous equivalent condition in the Artinian case (see Problem 8) is that every 
quotient module M/N is finitely cogenerated, that is, if the intersection of a collection of 
submodules of M//N is 0, then there is a finite subcollection whose intersection is 0. 


7.5.4 Definitions and Comments 


A ring R is Noetherian [resp. Artinian] if it is Noetherian [resp. Artinian] as a module 
over itself. If we need to distinguish between R as a left, as opposed to right, R-module, 
we will refer to a left Noetherian and a right Noetherian ring, and similarly for Artinian 
rings. This problem will not arise until Chapter 9. 


7.5.5 Examples 
1. Every PID is Noetherian. 
This follows from (7.5.3), since every ideal is generated by a single element. 


2. Z is Noetherian (a special case of Example 1) but not Artinian. There are many 
descending chains of ideals that do not stabilize, e.g., 


ZD (2) D (4) D (8) D.... 


We will prove in Chapter 9 that an Artinian ring must also be Noetherian. 


3. If F is a field, then the polynomial ring F[X] is Noetherian (another special case of 
Example 1) but not Artinian. A descending chain of ideals that does not stabilize is 


CXS Sale) cas 


4. The ring F[X1, X2,...] of polynomials over F' in infinitely many variables is neither 
Artinian nor Noetherian. A descending chain of ideals that does not stabilize is con- 
structed as in Example 3, and an ascending chain of ideals that does not stabilize 
is 


(Xi) Cc (X1, X2) Cc (X1, Xo, X3) Cu... 
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7.5.6 Remark 


The following observations will be useful in deriving properties of Noetherian and Artinian 
modules. If N < M, then a submodule L of M that contains N can always be written in 
the form kK +N for some submodule Kk. (AK = L is one possibility.) By the correspondence 
theorem, 


(K, + N)/N = (Ko+N)/N implies K, + N = Kp +N and 
(Ki + N)/N < (Ko + N)/N implies Ay tN < Kot N. 


7.5.7 Proposition 


If N is a submodule of M, then M is Noetherian [resp. Artinian] if and only if N and 
M/N are Noetherian [resp. Artinian]. 


Proof. Assume M is Noetherian. Then N is Noetherian by (2) of (7.5.2), since a submod- 
ule of N must also be a submodule of M. By (7.5.6), an ascending chain of submodules 
of M/N looks like (Mi + N)/N < (M2+N)/N <.---. But then the M; + N form an as- 
cending sequence of submodules of M, which must stabilize. Consequently, the sequence 
(M; + N)/N,i =1,2,..., must stabilize. 

Conversely, assume that N and M/N are Noetherian, and let M, < Mz <--- be 
an increasing sequence of submodules of M. Take 7 large enough so that both sequences 
{M; N N} and {M; +N} have stabilized. If x € Mist, then +N € Misi +N=M;4+N, 
sox =y+z where y € M; and ze€ N. Thusr—y € M41 NN = M,N, and since 
y © M; we have x € M; as well. Consequently, M; = Mj+1 and the sequence of M;’s 
has stabilized. The Artinian case is handled by reversing inequalities (and interchanging 
indices ¢ and i+ 1 in the second half of the proof). @& 


7.5.8 Corollary 
If M,,...,M, are Noetherian [resp. Artinian] R-modules, then so is M; @M2@---® Mp. 


Proof. It suffices to consider n = 2 (induction will take care of higher values of n). The 
submodule N = M, of M = M; © Mz is Noetherian by hypothesis, and M/N 2 Mo 
is also Noetherian (apply the first isomorphism theorem to the natural projection of M 
onto M2). By (7.5.7), M is Noetherian. The Artinian case is done the same way. d& 


7.5.9 Corollary 


If M is a finitely generated module over the Noetherian [resp. Artinian] ring R, then M 
is Noetherian |resp. Artinian] 


Proof. By (4.3.6), M is a quotient of a free module L of finite rank. Since L is the direct 
sum of a finite number of copies of R, the result follows from (7.5.8) and (7.5.7). & 


Ascending and descending chains of submodules are reminiscent of normal and sub- 
normal series in group theory, and in fact we can make a precise connection. 
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7.5.10 Definitions 
A series of length n for a module M is a sequence of the form 
M=Mj)>M,2>::->M, =0. 


The series is called a composition series if each factor module M;/M;4, is simple. [A 
module is simple if it is nonzero and has no submodules except itself and 0. We will study 
simple modules in detail in Chapter 9.) Thus we are requiring the series to have no proper 
refinement. Two series are equivalent if they have the same length and the same factor 
modules, up to isomorphism and rearrangement. 

By convention, the zero module has a composition series, namely {0} itself. 


7.5.11 Jordan-Holder Theorem For Modules 


If M has a composition series, then any two composition series for M are equivalent. Fur- 
thermore, any strictly decreasing sequence of submodules can be refined to a composition 
series. 


Proof. The development of the Jordan-Hélder theorem for groups can be taken over ver- 
batim if we change multiplicative to additive notation. In particular, we can reproduce the 
preliminary lemma (5.6.2), the Zassenhaus lemma (5.6.3), the Schreier refinement theo- 
rem (5.6.5), and the Jordan-Hélder Theorem (5.6.6). We need not worry about normality 
of subgroups because in an abelian group, all subgroups are normal. As an example of 
the change in notation, the Zassenhaus lemma becomes 


A+(BND) _ C+(DNB) 
A+(BNC) C+(DNA)’ 


This type of proof can be irritating, because it forces readers to look at the earlier de- 
velopment and make sure that everything does carry over. A possible question is “Why 
can’t a composition series S of length n coexist with an infinite ascending or descending 
chain?” But if such a situation occurs, we can form a series T for M of length n+1. By 
Schreier, S and 7 have equivalent refinements. Since S has no proper refinements, and 
equivalent refinement have the same length, we have n > +1, a contradiction. & 


We can now relate the ascending and descending chain conditions to composition 
series. 


7.5.12 Theorem 


The R-module M has a composition series if and only if M is both Noetherian and 
Artinian. 


Proof. The “only if” part was just done at the end of the proof of (7.5.11). Thus as- 
sume that M is Noetherian and Artinian. Assuming (without loss of generality) that 
M #0, it follows from (2) of (7.5.2) that My = M has a maximal proper submodule M,. 
Now M, is Noetherian by (7.5.7), so if M, 4 0, then M; has a maximal proper submodule 
Mz. Continuing inductively, we must reach 0 at some point because M is Artinian. By 
construction, each M;/M;+1 is simple, and we have a composition series for M. & 
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Here is a connection with algebraic number theory. 


7.5.13 Proposition 


In the basic AK LB setup of (7.3.9), assume that A is integrally closed. If A is a Noetherian 
ring, then so is B. In particular, the ring of algebraic integers in a number field is 
Noetherian. 


Proof. By the proof of (7.4.9), B is a submodule of a free A-module M of finite rank. 
(The assumption that A is a PID in (7.4.9) is used to show that A is integrally closed, 
and we have this by hypothesis. The PID assumption is also used to show that B is 
a free A-module, but we do not need this in the present argument.) By (7.5.8), M is 
Noetherian, so by (7.5.7), B is a Noetherian A-module. An ideal of B is, in particular, 
an A-submodule of B, hence is finitely generated over A and therefore over B. Thus B is 
a Noetherian ring. & 


Problems For Section 7.5 


1. Let p be a fixed prime, and let A be the abelian group of all rational numbers a/p”, 
n=0,1,...,a € Z, where all calculations are modulo 1, in other words, A is a subgroup 
of Q/Z. Let A, be the subgroup {0,1/p",2/p”,...,(p" — 1)/p"}. Show that A is not 
a Noetherian Z-module. 


2. Continuing Problem 1, if B is a proper subgroup of A, show that B must be one of 
the A,. Thus A is an Artinian Z-module. [This situation cannot arise for rings, where 
Artinian implies Noetherian.] 

3. If V is a vector space, show that V is finite-dimensional iff V is Noetherian iff V is 
Artinian iff V has a composition series. 


4. Define the length of a module M [notation I(M/)] as the length of a composition series 
for M. (If M has no composition series, take 1(M/) = co.) Suppose that we have a 
short exact sequence 


0 N= Se 


Show that /(M) is finite if and only if [(.N)) and 1(M/N) are both finite. 
5. Show that | is additive, that is, 


i(M) =1(N) + U(M/N). 


6. Let S be a subring of the ring R, and assume that S is a Noetherian ring. If R is 
finitely generated as a module over S, show that FR is also a Noetherian ring. 

7. Let R be a ring, and assume that the polynomial ring R[X] is Noetherian. Does it 
follow that R is Noetherian? 

8. Show that a module M is Artinian if and only if every quotient module M/N is finitely 
cogenerated. 
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7.6 Fractional Ideals 


Our goal is to establish unique factorization of ideals in a Dedekind domain, and to do 
this we will need to generalize the notion of ideal. First, some preliminaries. 


7.6.1 Definition 


If ),...,J, are ideals, the product I, --- I, is the set of all finite sums $0, a1;a2;--- ani, 
where az; € Ip, k= 1,...,n. It follows from the definition that the product is an ideal 
contained in each J;. 


7.6.2 Lemma 


If P is a prime ideal that contains a product [,---J, of ideals, then P contains J; for 
some j. 


Proof. If not, let aj € I; \P, j =1,...,n. Then a;---a, belongs to ,---I, C P, and 
since P is prime, some a; belongs to P, a contradiction. & 


7.6.3 Proposition 


If I is a nonzero ideal of the Noetherian integral domain R, then J contains a product of 
nonzero prime ideals. 


Proof. Assume the contrary. If S is the collection of all nonzero ideals that do not contain 
a product of nonzero prime ideals, then since R is Noetherian, S has a maximal element J, 
and J cannot be prime because it belongs to S. Thus there are elements a,b € R with 
ag J,b¢ J, and abé€ J. By maximality of J, the ideals J+ Ra and J+ Rb each contain 
a product of nonzero prime ideals, hence so does (J + Ra)(J + Rb) C J+ Rab = J. This 
is a contradiction. [Notice that we must use the fact that a product of nonzero ideals is 
nonzero, and this is where the hypothesis that R is an integral domain comes in.] & 


7.6.4 Corollary 


If I is an ideal of the Noetherian ring R (not necessarily an integral domain), then I 
contains a product of prime ideals. 


Proof. Repeat the proof of (7.6.3) with the word “nonzero” deleted. d& 


Ideals in the ring of integers are of the form nZ, the set of multiples of n. A set of the 
form 3Z, is not an ideal because it is not a subset of Z, yet it behaves in a similar manner. 
The set is closed under addition and multiplication by an integer, and it becomes an ideal 
of Z if we simply multiply all the elements by 2. It will be profitable to study sets of this 


type. 
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7.6.5 Definitions 


Let R be an integral domain, with K its quotient field, and let J be an R-submodule 
of K. We say that I is a fractional ideal of R if rl C R for some nonzero r € R. We will 
call r a denominator of I. An ordinary ideal of R is a fractional ideal (take r = 1), and 
will often be referred to an as integral ideal. 


7.6.6 Lemma 


(i) If I is a finitely generated R-submodule of K, then I is a fractional ideal. 


(ii) If R is Noetherian and J is a fractional ideal of R, then J is a finitely generated 
R-submodule of K. 

(iii) If J and J are fractional ideals with denominators r and s respectively, then IN J, 
I+ J and IJ are fractional ideals with respective denominators r (or s), rs and rs. 
[The product of fractional ideals is defined exactly as in (7.6.1).] 


Proof. (i) If a1 = a1/b1,...,2n = Gn/bn generate I and b = by --- by, then bI C R. 

(ii) If rf C R, then J C r~!R. As an R-module, r~'R is isomorphic to R and is 
therefore Noetherian. Consequently, I is finitely generated. 

(iii) It follows from the definition (7.6.5) that the intersection, sum and product of 
fractional ideals are fractional ideals. The assertions about denominators are proved by 
noting that r7N J) CrI CR, rsI+J)CrI+sJ CR, andrslJ =(rI)(sJ) CR. & 


The product of two nonzero fractional ideals is a nonzero fractional ideal, and the 
multiplication is associative (since multiplication in R is associative). There is an identity 
element, namely R, since RI C I = 17 C RI. We will show that if R is a Dedekind 
domain, then every nonzero fractional ideal has a multiplicative inverse, so the nonzero 
fractional ideals form a group. 


7.6.7 Definitions and Comments 

A Dedekind domain is an integral domain R such that 
(1) R is Noetherian, 

(2) R is integrally closed, and 

(3) Every nonzero prime ideal of R is maximal. 


Every PID is a Dedekind domain, by (7.5.5), (7.1.7), (2.6.8) and (2.6.9). We will prove 
that the algebraic integers of a number field form a Dedekind domain. But as we know, 
the ring of algebraic integers need not be a PID, or even a UFD (see the discussion at the 
beginning of this chapter, and the exercises in Section 7.7). 


7.6.8 Lemma 


Let I be a nonzero prime ideal of the Dedekind domain R, and let J = {x € K: aI C R}. 
Then RC J. 
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Proof. Since RI C R, it follows that R is a subset of J. Pick a nonzero element a € J, 
so that J contains the principal ideal Ra. Let n be the smallest positive integer such 
that Ra contains a product P,---P, of n nonzero prime ideals. Since R is Noetherian, 
there is such an n by (7.6.3), and by (7.6.2), J contains one of the P;, say P;. But ina 
Dedekind domain, every nonzero prime ideal is maximal, so J = P,. Assuming n > 2, set 
I, = P,--- Py, so that Ra D Ty by minimality of n. Choose b € , with b ¢ Ra. Now 
II, © Ra, in particular, Ib C Ra, hence Iba~! C R. (Note that a has an inverse in K, 
but not necessarily in R.) Thus ba~! € J, but ba~! ¢ R, for if so, b € Ra, contradicting 
the choice of b. 

The case n = 1 must be handled separately. In this case, P} = I D Ra D P,, so 
I = Ra. Thus Ra is a proper ideal, and we can choose b € R with b ¢ Ra. Then 
ba~! ¢ R, but ba-'IT = ba !Ra=bRCR,so bates. & 


We now prove that in (7.6.8), J is the inverse of I. 


7.6.9 Proposition 


Let I be a nonzero prime ideal of the Dedekind domain R, and let J = {x € K: aI C R}. 
Then J is a fractional ideal and IJ = R. 


Proof. By definition, J is an R-submodule of K. If r is a nonzero element of J and x € J, 
then rz € R, so rJ C R and J is a fractional ideal. Now IJ C R by definition of J, so 
IJ is an integral ideal. Since (using (7.6.8)) J = IR C IJ C R, maximality of I implies 
that either JJ = I or IJ = R. In the latter case, we are finished, so assume IJ = I. 

If « € J, then «J C IJ = I, and by induction, x"J C I for alln = 1,2,.... Let r be 
any nonzero element of I. Then ra” € «"I CIC R, so R{] is a fractional ideal. Since 
R is Noetherian, part (ii) of (7.6.6) implies that R[a] is a finitely generated R-submodule 
of K. By (7.1.2), x is integral over R. But R, a Dedekind domain, is integrally closed, so 
x € R. Therefore J C R, contradicting (7.6.8). & 


Problems For Section 7.6 


1. Show that a proper ideal P is prime if and only if for all ideals A and B, P D> AB 
implies that P D A or PDB. 


We are going to show that if an ideal I is contained in the union of the prime ideals 
P,,..., Pn, then I is contained in some P;. Equivalently, if for all 2 = 1,...,n, we have 
I ¢ P,, then I Z UP, P;. There is no problem when n = 1, so assume the result holds 
for n — 1 prime ideals. By the induction hypothesis, for each 7 there exists x; € I with 
2. Show that we can assume without loss of generality that x; € P; for all 7. 

3. Continuing Problem 2, let « = SY, @1-+-@j-1%i41-°++2n. Show that x € I but 

x ¢ Ut_, P;, completing the proof. 

4. If I and J are relatively prime ideals (I+ J = R), show that IJ = IN J. More generally, 
if I1,...,Z, are relatively prime in pairs (see (2.3.7)), show that Iy--- I, =k. 
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5. Show that if a Dedekind domain R is a UFD, then R is a PID. 


6. Suppose that in (7.6.9), we would like to invert every maximal ideal of R, rather than 
the nonzero prime ideals. What is a reasonable hypothesis to add about R? 


7. Let R be an integral domain with quotient field K. If K is a fractional ideal of R, 
show that R= k. 


8. Let P; and Py» be relatively prime ideals in the ring R. Show that Pf and P; are 
relatively prime for arbitrary positive integers r and s. 


7.7 Unique Factorization of Ideals in a Dedekind 
Domain 


In the previous section, we inverted nonzero prime ideals in a Dedekind domain. We must 
now extend this result to nonzero fractional ideals. 


7.7.1 Theorem 


If J is anonzero fractional ideal of the Dedekind domain R, then J can be factored uniquely 
as Pj"! P'? ..-P? where the P; are prime ideals and the n; are integers. Consequently, 
the nonzero fractional ideals form a group under multiplication. 


Proof. First consider the existence of such a factorization. Without loss of generality, we 
can restrict to integral ideals. [Note that if r #0 and rl C R, then I = (Rr)~1(rI).] By 
convention, we regard R as the product of the empty collection of prime ideals, so let S 
be the set of all nonzero proper ideals of R that cannot be factored in the given form, 
with all n; positive integers. [This trick will yield the useful result that the factorization 
of integral ideals only involves positive exponents.] Since R is Noetherian, S, if nonempty, 
has a maximal element Jp, which is contained in a maximal ideal I. By (7.6.9), J has an 
inverse fractional ideal J. Thus by (7.6.8) and (7.6.9), 


Therefore JoJ is an integral ideal, and we claim that Ig C IpJ. For if Ip = IpJ, the 
last paragraph of the proof of (7.6.9) can be reproduced with I replaced by Jp to reach a 
contradiction. By maximality of Ip, [oJ is a product of prime ideals, say Ip J = P, ---: P, 
(with repetition allowed). Multiply both sides by the prime ideal J to conclude that Jp is 
a product of prime ideals, contradicting Ip € S. Thus S must be empty, and the existence 
of the desired factorization is established. 

To prove uniqueness, suppose that we have two prime factorizations 


My a ty t 
Pe pi = OOO 


where again we may assume without loss of generality that all exponents are positive. 
[If P~" appears, multiply both sides by P”.| Now P; contains the product of the P,”, 
so by (7.6.2), P, contains Q; for some j7. By maximality of Q;, P; = Q;, and we may 
renumber so that P,; = Qi. Multiply by the inverse of P, (a fractional ideal, but there is 
no problem) to cancel P, and Q,, and continue inductively to complete the proof. d& 
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7.7.2 Corollary 


A nonzero fractional ideal J is an integral ideal if and only if all exponents in the prime 
factorization of I are nonnegative. 


Proof. The “only if” part was noted in the proof of (7.7.1). The “if” part follows because 
a power of an integral ideal is still an integral ideal. d& 


7.7.3 Corollary 


Denote by np(I) the exponent of the prime ideal P in the factorization of I. (If P does 
not appear, take np(I) = 0.) If I; and Jy are nonzero fractional ideals, then I; D Ip if 
and only if for every prime ideal P of R, np(1) < np(I2). 


Proof. We have Iz C I, iff IgI>' C R, and by (7.7.2), this happens iff for every P, 
np(I2)—np(h) >0. de 


7.7.4 Definition 


Let J, and Ip be nonzero integral ideals. We say that I, divides Ig if Ig = JI, for some 
integral ideal J. Just as with integers, an equivalent statement is that each prime factor 
of J; is a factor of Ig. 


7.7.5 Corollary 


If J, and Iz are nonzero integral ideals, then J, divides Jz if and only if Jy D Ig. In other 
words, for these ideals, 


DIVIDES MEANS CONTAINS. 


Proof. By (7.7.4), I) divides Ig iff np(1) < np(I2) for every prime ideal P. By (7.7.3), 
this is equivalent to; DIg. & 


The next result explains why Dedekind domains are important in algebraic number 
theory. 


7.7.6 Theorem 


In the basic AK LB setup of (7.3.9), if A is a Dedekind domain, then so is B. In particular, 
the ring of algebraic integers in a number field is a Dedekind domain. In addition, B is a 
finitely generated A-module and the quotient field of B is L. 


Proof. By (7.1.6), B is integrally closed in L. The proof of (7.4.7), with x; replaced by an 
arbitrary element of L, shows that LD is the quotient field of B. Therefore B is integrally 
closed. By (7.5.13), B is a Noetherian ring, and the proof of (7.5.13) shows that B is a 
Noetherian, hence finitely generated, A-module. 
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It remains to prove that every nonzero prime ideal Q of B is maximal. Choose any 
nonzero element x of Q. Since x € B, x satisfies a polynomial equation 


ew t+an_1e” tf +--+ ax +a) =0 


with the a; € A. If we take the positive integer n as small as possible, then a9 4 0 by 
minimality of n. Solving for ag, we see that ag € Br NA CQNA, so P=QNAF 0D. But 
P is the preimage of the prime ideal Q under the inclusion map of A into B. Therefore P 
is a nonzero prime, hence maximal, ideal of the Dedekind domain A. Consequently, A/P 
is a field. 

Now A/P can be identified with a subring of the integral domain B/Q via y+ P > 
y+Q. Moreover, B/Q is integral over A/P. [B is integral over A, and we can simply use 
the same equation of integral dependence.] It follows from Section 7.1, Problem 5, that 
B/Q is a field, so Q is a maximal ideal. & 


Problems For Section 7.7 


By (7.2.3), the ring B of algebraic integers in Q(./—5) is Z[V/—5]. We will show that 
Z|V—5] is not a unique factorization domain. (For a different approach, see Section 2.7, 
Problems 5-7.) Consider the factorization 


(1+ V—5)(1 — V-5) = (2)(8). 


1. By computing norms, verify that all four of the above factors are irreducible. 
2. Show that the only units of B are +1. 


3. Show that no factor on one side of the above equation is an associate of a factor on 
the other side, so unique factorization fails. 


4. We can use the prime factorization of ideals in a Dedekind domain to compute the 
greatest common divisor and the least common multiple of two nonzero ideals J and J, 
exactly as with integers. Show that the greatest common divisor of J and J is I+ J 
and the least common multiple is IN J. 

5. A Dedekind domain R comes close to being a PID in the following sense. (All ideals 
are assumed nonzero.) If I is an integral ideal, in fact if J is a fractional ideal, show 
that there is an integral ideal J such that IJ is a principal ideal of R. 

6. Show that the ring of algebraic integers in Q(,/—17) is not a unique factorization 
domain. 


7. In Problem 6, the only algebraic integers of norm 1 are +1. Show that this property 
does not hold for the algebraic integers in Q(./—3). 


7.8 Some Arithmetic in Dedekind Domains 


Unique factorization of ideals in a Dedekind domain permits calculations that are analo- 
gous to familiar manipulations involving ordinary integers. In this section, we illustrate 
some of the ideas. 
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Let P,,...,P, be distinct nonzero prime ideals of the Dedekind domain R, and let 
J=P,---P,. Let Q; be the product of the P; with P; omitted, that is, 


Qi = Pr s+ Pra Pra +++ Pr 


(If n = 1, we take Q; = R.) If I is any nonzero ideal of R, then by unique factorization, 
IQ; D IJ. For each 1 = 1,...,n, choose an element a; belonging to 1Q; but not to IJ, 
and let a = )7j._, ai. 


7.8.1 Lemma 
a € I but for each i, a ¢ IP;. (In particular, a £ 0.) 


Proof. Since each a; belongs to IQ; C I, we have a € I. Now a; cannot belong to IP;,, for 
if so, ag € IP; NIQ;, which is the least common multiple of [P; and IQ; (see Section 7.7, 
Problem 4). But by definition of Q;, the least common multiple is simply IJ, and this 


contradicts the choice of a;. We break up the sum defining a as follows: 
a= (a, +--+ + 4:1) + ai + (Gi41 + +++ +n). (1) 


If 7 £2, then a; € IQ; C IP,, so the first and third terms of (1) belong to IP;. Since 
a; ¢ IP;, as found above, we havea ¢ IP;. & 


In Section 7.7, Problem 5, we found that any nonzero ideal is a factor of a principal 
ideal. We can sharpen this result as follows. 


7.8.2 Proposition 


Let I be a nonzero ideal of the Dedekind domain R. Then there is a nonzero ideal I’ such 
that IJ’ is a principal ideal (a). Moreover, if J is an arbitrary nonzero ideal of R, I’ can 
be chosen to be relatively prime to J. 


Proof. Let P,,..., Py be the distinct prime divisors of J, and choose a as in (7.8.1). Then 
a € I, so (a) C I. Since divides means contains (see (7.7.5)), I divides (a), so (a) = II’ 
for some nonzero ideal I’. If I’ is divisible by P;, then I’ = P;Ip for some nonzero ideal Jo, 
and (a) = IP;Ip. Consequently, a € IP;, contradicting (7.8.1). & 


7.8.3 Corollary 
A Dedekind domain with only finitely many prime ideals is a PID. 


Proof. Let J be the product of all the nonzero prime ideals. If J is any nonzero ideal, 
then by (7.8.2) there is a nonzero ideal I’ such that IJ’ is a principal ideal (a), with I’ 
relatively prime to J. But then the set of prime factors of I’ is empty, so that I’ = R. 
Thus (a) =II’=IR=I. & 


The next result shows that a Dedekind domain is not too far away from a principal 
ideal domain. 


7.9. P-ADIC NUMBERS 29 


7.8.4 Corollary 


Let I be a nonzero ideal of the Dedekind domain R, and let a be any nonzero element 
of I. Then J can be generated by two elements, one of which is a. 


Proof. Since a € I, we have (a) C I, so I divides (a), say (a) = IJ. By (7.8.2) there is 
a nonzero ideal J’ such that IJ’ is a principal ideal (b) and I’ is relatively prime to J. If 
gcd stands for greatest common divisor, then the ideal generated by a and b is 


gcd((a), (b)) = ged(IJ, II’) = I 


since gcd(J,I') =(1). & 


Problems For Section 7.8 


1. Let I(R) be the group of nonzero fractional ideals of a Dedekind domain R. If P(R) is 
the subset of J(R) consisting of all nonzero principal fractional ideals Rx, x € K, show 
that P(R) is a subgroup of [(R). The quotient group C(R) = I(R)/P(R) is called the 
ideal class group of R. Since R is commutative, C(R) is abelian, and it can be shown 
that in the number field case, C(R) is finite. 


2. Continuing Problem 1, show that C(R) is trivial iff R is a PID. 


We will now go through the factorization of an ideal in a number field. The neces- 
sary background is developed in a course in algebraic number theory, but some of the 
manipulations are accessible to us now. By (7.2.3), the ring B of algebraic integers of the 
number field Q(.\/—5) is Z[\/—5]. (Note that —5 = 3 mod 4.) If we wish to factor the 
ideal (2) = 2B in B, the idea is to factor x? +5 mod 2, and the result is 7 +5 = (x+1)? 
mod 2. Identifying x with /—5, we form the ideal Py = (2,1 ++./—5), which turns out to 
be prime. The desired factorization is (2) = P?. This technique works if B = Z[a], where 
the number field L is Q(a). 


. Show that 1 — /—5 € Py, and conclude that 6 € P?. 
. Show that 2 € P?, hence (2) C P?. 
. Expand P? = (2,1 + /—5)(2,1+ V—5), and conclude that P? C (2). 


. Following the technique suggested in the above problems, factor x? + 5 mod 3, and 
conjecture that the prime factorization of (3) in the ring of integers of Q(/—5) is 
(3) = P3P§ for appropriate P3; and P%. 


7. With P; and P3 as found in Problem 6, verify that (3) = P3P%. 


aD oOo FF W 


7.9 p-adic Numbers 


We will give a very informal introduction to this basic area of number theory. ( For further 
details, see Gouvea, “p-adic Numbers”.) Throughout the discussion, p is a fixed prime. 
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7.9.1 Definitions and Comments 
A p-adic integer can be described in several ways. One representation is via a series 
£=ao+aipt+aop? +---, a, EZ. (1) 


(Let’s ignore the problem of convergence for now.) The partial sums are x, = a9 + aip+ 
+++ +4np", so that tp — %n_1 = Gnp". A p-adic integer can also be defined as a sequence 
of integers « = {20,21,..., } satisfying 


Ln =XLn—-1 Mod p”, n=1,2,.... (2) 
Given a sequence satisfying (2), we can recover the coefficients of the series (1) by 


1 — Xo t2— % 
ag = %, Ay = —, AQ >= 2 grees 


Pp Pp 


The sequences x and y are regarded as defining the same p-adic integer iff 7, = yn mod 
p’+! mn =0,1,.... Replacing each x, by the smallest nonnegative integer congruent to 
it mod p”*? is equivalent to restricting the a; in (1) to {0,1,...,p—1}. [We call this the 
standard representation.| Thus (1) is the limiting case in some sense of an expansion in 
base p. 

Sums and products of p-adic integers can be defined by polynomial multiplication if (1) 
is used. With the representation (2), we take 


r+y={Irt+ Yn}, ry ={lnYn}- 


With addition and multiplication defined in this way, we get the ring of p-adic integers, 
denoted by @,. (A more common notation is Z,, with the ring of integers modulo p 
written as Z/pZ. Since the integers mod p occur much more frequently in this text than 
the p-adic integers, and Z, is a bit simpler than Z/pZ, I elected to use Z, for the integers 
mod p.) The rational integers Z form a subring of 6, via x = {x,x,z,... }. 

We now identify the units of 4). 


7.9.2 Proposition 


The p-adic integer x = {x,,} is a unit of @, (also called a p-adic unit) if and only if zo £0 
mod p. In particular, a rational integer a is a p-adic unit if and only if a 40 mod p. 


Proof. If (ag + ayp + -+-)(bo + bip +--+) = 1, then agbp = 1, so ag # 0 mod p, proving 
the “only if” part. Thus assume that x 4 0 mod p. By (2), tm, = Um—-1 = +--+: = Xo 
mod p, so z, # 0 mod p. Therefore x, and p"t! are relatively prime, so there exists 
Yn such that %pyn = 1 mod p™*, hence mod p”. Now by (2), an = %p—1 mod p”, so 
InYn—-1 = Ln-1Yn—-1 = 1 mod p”. Thus enyn = LnYn—1 mod p”, so Yn = Yn—1 mod p”. 
The sequence y = {y,,} is therefore a p-adic integer, and by construction, zy=1. & 
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7.9.3 Corollary 


Every nonzero p-adic integer has the form « = p”u where n > 0 and u is a p-adic unit. 
Consequently, #, is an integral domain. Furthermore, @, has only one prime element p, 
and every x € @, is a power of p, up to multiplication by a unit. 


Proof. The series representation for x has a nonzero term a,p” of lowest degree n, where 
dy, can be taken between 1 and p—1. Factor out p” to obtain « = p”u, where u is a unit 
by (7.9.2). & 


7.9.4 Definitions and Comments 


The quotient field Q, of @, is called the field of p-adic numbers. By (7.9.3), each a € Q, 
has the form p™u, where m is an integer (possibly negative) and wu is a unit in 6,. Thus 
a has a “Laurent expansion” 


ed | | 
+.--4 S +ag9tayprrr:. 


Another representation is a = x/p", where x is a p-adic integer and r > 0. This version 
is convenient for doing addition and multiplication in Q,. 

The rationals Q are a subfield of Q,. To see this, let a/b be a rational number in 
lowest terms (a and 6 relatively prime). If p does not divide b, then by (7.9.2), b is a unit 
of @,. Since a € Z C 6,, we have a/b € 6). If b = p‘b’ where p does not divide b’, we 
can factor out p’ and reduce to the previous case. Thus a/b always belongs to Q,, and 
a/b € 6, iff p does not divide b. Rational numbers belonging to 4, are sometimes called 
p-integers. 

We now outline a procedure for constructing the p-adic numbers formally. 


7.9.5 Definitions and Comments 
The p-adic valuation on Q, is defined by 
Up(p™u) =m. 
In general, a valuation v on a field F is a real-valued function on F' \ {0} satisfying: 


(a) v(xy) = v(x) + v(y); 
(b) o(a + y) 2 min(v(z), o(y)). 


By convention, we take v(0) = + oo. 
The representation x = p’™u shows that v, is indeed a valuation on Q,. If c is any 
real number greater than 1, then the valuation v induces an absolute value on F’, namely, 


|x| = c~?), 


When v = vp, the constant c is usually taken to be p, and we obtain the p-adic absolute 
value 


lZlp a 


32 CHAPTER 7. INTRODUCING ALGEBRAIC NUMBER THEORY 


n 


Thus the p-adic absolute value of p” is p~”, which approaches 0 exponentially as n ap- 


proaches infinity. 
In general, an absolute value on a field F is a real-valued function on F such that: 


(i) |2| > 0, with equality if and only if « = 0; 
(ii) |ey| = |2llyl; 
(ili) [2 + yl < |2| + [yl 


By (b), an absolute value induced by a valuation satisfies a property that is stronger 
than (iii): 


(iv) a + y| < max(|2], |y)). 


An absolute value satisfying (iv) is said to be nonarchimedian. 


7.9.6 Proposition 


Let F' be the quotient field of an integral domain R. The absolute value | | on F is 
nonarchimedian if and only if |n| < 1 for every integer n = 1 1eER. 


Proof. Assume a nonarchimedian absolute value. By property (ii) of (7.9.5), |41| = 1. If 
|n| < 1, then by property (iv), |y+1| < 1, and the desired conclusion follows by induction. 
Conversely, assume that the absolute value of every integer is at most 1. To prove (iv), it 
suffices to show that |x +1] < max(|z|,1) for every x € F. [If y £0 in (iv), divide by |y].] 
By the binomial theorem, 


n . nr vi . n Tr 
err =1o (eis O1(") ter 
r=0 r=0 


By hypothesis, the integer (") has absolute value at most 1. If |z| > 1, then |z|" < |z|” 
for allr =0,1,...,n. If |a| <1, then |a|" < 1. Consequently, 


jo +1)" < (n+ 1) max(|z|", 1). 
Take n‘” roots and let n — oo to get |2 +1] < max(|z|,1). & 


The next result may seem innocuous, but it leads to a remarkable property of nonar- 
chimedian absolute values. 


7.9.7 Proposition 


If | | is a nonarchimedian absolute value, then 


|| A |y| implies |x + y| = max(|a|,|y|). 
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Proof. First note that | — y| = |(—1)y| = | — 1||y| = |y|. We can assume without loss of 
generality that || > |y|. Using property (iv) of (7.9.5), we have 


|x| = |e +y—y| < max(|x+ yl, |yl) =|2 + yl. 


(Otherwise, max(|xz + y], |y|) = |y|, hence |x| < |y| < |a|, a contradiction.] Since |x + y| < 
max(|z|,|y|) = |a|, the result follows. & 


Any absolute value determines a metric via d(z,y) = |x — y|. This distance function 
can be used to measure the length of the sides of a triangle. 


7.9.8 Corollary 


With respect to the metric induced by a nonarchimedian absolute value, all triangles are 
isosceles. 


Proof. Let the vertices of the triangle be x,y and z. Then 


Jz — yl = |(@— 2) + (2—-y)I 


If |a — z| = |z — y|, then two side lengths are equal. If |a — z| 4 ||z — y|, then by (7.9.7), 
|x — y| = max(|a — z|,|z — y|), and again two side lengths are equal. & 


We now look at the p-adic numbers from the viewpoint of valuation theory. 


7.9.9 Definitions and Comments 


Let | | be a nonarchimedian absolute value on the field F’. The valuation ring of | | is 
6={x Ee F: |x| < 1}. 


In the p-adic case, 9 = {x € Q,: vp(x) > 0} = 6,. By properties (ii) and (iv) of (7.9.5), 6 
is a subring of F’. 
The valuation ideal of | | is 


B={axe F: |x| < 1}. 


In the p-adic case, 8 = {x € Qp: up(x) > 1} = pb, those p-adic integers whose series 
representation has no constant term. To verify that (@ is an ideal of @, note that if x,y € B 
and r € @, then |rz| = |r||z| < |z| < 1 and ja + y| < max(|z|, |y|) <1. 

Now if « € 6 \ 6, then |a| = 1, hence |z~+| = 1/|z] = 1, so a7! € O and wisa 
unit of 8. On the other hand, if « € @, then x cannot be a unit of 6. [If ey = 1, then 
1 = |z||y| < || < 1, a contradiction.] Thus the ideal @ is the set of all nonunits of 6. No 
proper ideal I of 6 can contain a unit, so J C (. It follows that @ is the unique maximal 
ideal of 9. A ring with a unique maximal ideal is called a local ring. We will meet such 
rings again when we examine the localization process in Section 8.5. 

To construct the p-adic numbers, we start with the p-adic valuation on the integers, 
and extend it to the rationals in the natural way: v,(a/b) = vp(a) — vp(b). The p-adic 
valuation then determines the p-adic absolute value, which induces a metric d on Q. 
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[Because d comes from a nonarchimedian absolute value, it will satisfy the ultrametric 
inequality d(x,y) < max(d(x, z),d(z,y)), which is stronger than the triangle inequality.] 
The process of constructing the real numbers by completing the rationals using equivalence 
classes of Cauchy sequences is familiar. The same process can be carried out using the 
p-adic absolute value rather than the usual absolute value on Q. The result is a complete 
metric space, the field of p-adic numbers, in which Q is dense. 

Ostrowski’s theorem says that the usual absolute value | |, on Q, along with the p-adic 
absolute values | |, for all primes p, and the trivial absolute value (|0| = 0;|z| = 1 for 
x #0), essentially exhaust all possibilities. To be more precise, two absolute values on a 
field F' are equivalent if the corresponding metrics on F' induce the same topology. Any 
nontrivial absolute value on Q is equivalent to | |. or to one of the | |». 


Problems For Section 7.9 


1. Take p = 3, and compute the standard representation of (2 + p + p?)(2 + p”) in two 
ways, using (1) and (2) of (7.9.1). Check the result by computing the product using 
ordinary multiplication of two integers, and then expanding in base p = 3. 


2. Express the p-adic integer -1 as an infinite series of the form (1), using the standard 
representation. 


. Show that every absolute value on a finite field is trivial. 
. Show that an absolute value is archimedian iff the set S = {|n|: n € Z} is unbounded. 


. Show that a field that has an archimedian absolute value must have characteristic 0. 


Da om KR W 


. Show that an infinite series 5> z, of p-adic numbers converges if and only if z, — 0 as 
nm — OOo. 


7. Show that the sequence a, = n! of p-adic integers converges to 0. 


8. Does the sequence an = n converge in Q,? 


Chapter - 8 
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Chapter 8 


Introducing Algebraic 
Geometry 


(Commutative Algebra 2) 


We will develop enough geometry to allow an appreciation of the Hilbert Nullstellensatz, 
and look at some techniques of commutative algebra that have geometric significance. As 
in Chapter 7, unless otherwise specified, all rings will be assumed commutative. 


8.1 Varieties 


8.1.1 Definitions and Comments 


We will be working in k[X1,..., X,], the ring of polynomials in n variables over the field 
k. (Any application of the Nullstellensatz requires that k be algebraically closed, but we 
will not make this assumption until it becomes necessary.) The set A” = A”(k) of all 
n-tuples with components in k is called affine n-space. If S is a set of polynomials in 
k[X1,...,Xn], then the zero-set of S, that is, the set V = V(S) of all a € A” such that 
f(x) = 0 for every f € S, is called a variety. (The term “affine variety” is more precise, 
but we will use the short form because we will not be discussing projective varieties.) 
Thus a variety is the solution set of simultaneous polynomial equations. 

If I is the ideal generated by S, then J consists of all finite linear combinations SO g; fi 
with g; € k[X1,..., Xn] and f; € S. It follows that V(S) = V(J), so every variety is the 
variety of some ideal. We now prove that we can make A” into a topological space by 
taking varieties as the closed sets. 


8.1.2 Proposition 


(1) If Va =V (iq) for all a € T, then ()V. = V(U I.). Thus an arbitrary intersection of 
varieties is a variety. 
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(2) If V; =V(U;), j=1,...,7, then Uja1 Vi =V({fi--- fp: fp €,1< 9 <r}). Thusa 
finite union of varieties is a variety. 


(3) A” = V(0) and # = V(1), so the entire space and the empty set are varieties. 


Consequently, there is a topology on A”, called the Zariski topology, such that the 
closed sets and the varieties coincide. 


Proof. (1) If a € A”, then x € ()V, iff every polynomial in every J, vanishes at «x iff 
xe€V(UIo). 

(2) rE ies V; iff for some j, every f; € I; vanishes at ¢ iffa eV({fi--- fr: fy EL 
for all j}). 

(3) The zero polynomial vanishes everywhere and a nonzero constant polynomial van- 
ishes nowhere. @& 


Note that condition (2) can also be expressed as 
jai =V | [TG] =V 5a). 
j=l 


[See (7.6.1) for the definition of a product of ideals.] 
We have seen that every subset of k[1X1,...,X,], in particular every ideal, determines 
a variety. We can reverse this process as follows. 


8.1.3. Definitions and Comments 


If X is an arbitrary subset of A”, we define the ideal of X as I(X) = {f € k[Xy,..., Xp]: 
f vanishes on X}. By definition we have: 


(4) If X CY then I(X) D I(Y); if S C T then V(S) D2 V(T). 


Now if S is any set of polynomials, define [V(S) as I(V(S)), the ideal of the zero-set 
of S; we are simply omitting parentheses for convenience. Similarly, if X is any subset 
of A”, we can define VI(X), IVI(X), VIV(S), and so on. From the definitions we 
have: 


(5) IV(S) DS; VI(X) DX. 


(If f € S then f vanishes on V(S), hence f € IV(S). If x € X then every polynomial in 
I(X) vanishes at x, so x belongs to the zero-set of I(X).] 
If we keep applying V’s and I’s alternately, the sequence stabilizes very quickly: 


(6) VIV(S) = V(S); IVI(X) = I(X). 


[In each case, apply (4) and (5) to show that the left side is a subset of the right side. If 
x € V(S) and f € IV(S) then f(x) = 0, so x € VIV(S). If f € I(X) and x € VI(X) 
then x belongs to the zero-set of I[(X), so f(x) = 0. Thus f vanishes on VI(X), so 
f eIVI(Xx),] 

Since every polynomial vanishes on the empty set (vacuously), we have: 
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(7) 1(O0) = kLXy,..., Xn]. 
The next two properties require a bit more effort. 


(8) If & is an infinite field, then [(A”) = {0}; 
(9) If w = (a1,...,an) € A”, then I({x}) = (X1 — ay,..., Xn — an). 


Property (8) holds for n = 1 since a nonconstant polynomial in one variable has only 
finitely many zeros. Thus f 4 0 implies that f ¢ [(A'). Ifn > 1, let f =a,X7+--°+ 
a,X + ao where the a; are polynomials in X2,...,X, and a, # 0. By the induction 
hypothesis, there is a point (v2,...,2%n) at which a, does not vanish. Fixing this point, 
we can regard f as a polynomial in X,, which cannot possibly vanish at all 7; € k. Thus 
f € I(A”). 

To prove (9), note that the right side is contained in the left side because X; — a; is 0 
when X; = a;. Also, the result holds for n = 1 by the remainder theorem (2.5.2). Thus 
assume n > 1 and let f = b,X7 +---+6,X1+ bo € I({x}), where the 6; are polynomials 
in Xo,...,X, and b, £0. By the division algorithm (2.5.1), we have 


f = (X1 ae a1)g(X1,. oe Xn) + h(Xa,... Xn) 


and h must vanish at (a2,...,@,). By the induction hypothesis, h € (X2—a2,...,Xn—Gn), 
hence f (X1 ay, X2 a2,...,Xn — An). 


Problems For Section 8.1 


A variety is said to be reducible if it can be expressed as the union of two proper subva- 
rieties; otherwise the variety is irreducible. In Problems 1-4, we are going to show that a 
variety V is irreducible if and only if [(V) is a prime ideal. 


1. Assume that I(V) is not prime, and let f, fo € I(V) with fi, fo ¢ 1(V). Ifa € V, 
show that « ¢ V(f,) implies x € V(fo2) (and similarly, « ¢ V(f2) implies « € V(f1)). 


2. Show that V is reducible. 
3. Show that if V and W are varieties with V C W, then I(V) D I(W). 


4. Now assume that V = Vi UV2, with Vi,V2 C V. By Problem 3, we can choose 
fi € T(Vi) with f; ¢ I(V). Show that fifo € I(V), so I(V) is not a prime ideal. 

5. Show that any variety is the union of finitely many irreducible subvarieties. 

6. Show that the decomposition of Problem 5 is unique, assuming that we discard any 
subvariety that is contained in another one. 


7. Assume that k is algebraically closed. Suppose that A” is covered by open sets A” \ 
V(J;) in the Zariski topology. Let J is the ideal generated by the J;, so that I = So hi, 
the set of all finite sums 2;, +--+ a;, with 7;, € Ij, Show that 1 € J. (You may appeal 
to the weak Nullstellensatz, to be proved in Section 8.4.) 


8. Show that A” is compact in the Zariski topology. 


4 CHAPTER 8. INTRODUCING ALGEBRAIC GEOMETRY 


8.2 The Hilbert Basis Theorem 


If S is a set of polynomials in k[Xj,...,X»], we have defined the variety V(S') as the 
zero-set of S, and we know that V(.S) = V(J), where J is the ideal generated by S. Thus 
any set of simultaneous polynomial equations defines a variety. In general, infinitely many 
equations may be involved, but as Hilbert proved, an infinite collection of equations can 
always be replaced by a finite collection. The reason is that every ideal of k[X1,..., Xn] 
has a finite set of generators, in other words, k[X,,..., Xn] is a Noetherian ring. The 
field & is, in particular, a PID, so k is Noetherian. The key step is to show that if R isa 
Noetherian ring, then the polynomial ring in n variables over R is also Noetherian. 


8.2.1 Hilbert Basis Theorem 
If R is a Noetherian ring, then R[X,,...,Xy,] is also Noetherian. 


Proof. By induction, we can assume n = 1. Let I be an ideal of R[X], and let J be the ideal 
of all leading coefficients of polynomials in I. (The leading coefficient of 5X? — 3X + 17 
is 5; the leading coefficient of the zero polynomial is 0.) By hypothesis, J is finitely 
generated, say by a1,...,@,. Let f; be a polynomial in J whose leading coefficient is aj, 
and let d; be the degree of f;. Let J* consist of all polynomials in I of degree at most 
d = max{d;: 1 <i <n}. Then J* is an R-submodule of the free R-module M of all 
polynomials by + b)X +--+ +bgX%,b; € R. Now a finitely generated free R-module is a 
finite direct sum of copies of R, hence M, and therefore J*, is Noetherian. Thus J* can 
be generated by finitely many polynomials g1,...,gm. Take Ip to be the ideal of R[X] 
generated by fi,.-.,fns91;---;9m- We will show that Jy = J, proving that J is finitely 
generated. 

First observe that f; € J and g; € I* C I, so Ig C J. Thus we must show that each 
h € I belongs to Ip. 

Case 1: degh <d 

Then h € I*, so h is a linear combination of the g; (with coefficients in R C R[X]), so 
he Io. 

Case 2: degh=r>d 

Let a be the leading coefficient of h. Since a € J, we have a = >i, cia; with the 
c;, © R. Let 


g= h— So og Xf, el. 
t=1 


The coefficient of X” in q is 


n 
a-— y ca; =0 
i=l 


so that deg q < r. We can iterate this degree-reduction process until the resulting poly- 
nomial has degree d or less, and therefore belongs to Jp. But then h is a finite linear 
combination of the f; and g;. 
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8.2.2 Corollary 


Every variety is the intersection of finitely many hypersurfaces (zero-sets of single poly- 
nomials). 


Proof. Let V = V(I) be a variety. By (8.2.1), I has finitely many generators fi,..., fr. 
But then V=();_, V(fi). & 


8.2.3. Formal Power Series 


The argument used to prove the Hilbert basis theorem can be adapted to show that if R is 
Noetherian, then the ring R[[X]] of formal power series is Noetherian. We cannot simply 
reproduce the proof because an infinite series has no term of highest degree, but we can 
look at the lowest degree term. If f = a,X" +a,,,X"t!+-+--, where r is a nonnegative 
integer and a, # 0, let us say that f has degree r and leading coefficient a,. (If f = 0, 
take the degree to be infinite and the leading coefficient to be 0.) 

If J is an ideal of R[[X]], we must show that I is finitely generated. We will inductively 
construct a sequence of elements f; € R[[X]] as follows. Let f; have minimal degree among 
elements of J. Suppose that we have chosen f;,..., f;, where f; has degree d; and leading 
coefficient a;. We then select fi+1 satisfying the following three requirements: 


1. fii belongs to I; 
2. ai41 does not belong to (a1,...,a;), the ideal of R generated by the a;, 7 =1,...,%; 
3. Among all elements satisfying the first two conditions, f;;1 has minimal degree. 

The second condition forces the procedure to terminate in a finite number of steps; 
otherwise there would be an infinite ascending chain (a1) C (a1, a2) C (a1, a2,a3) C-+-. 
If stabilization occurs at step k, we will show that I is generated by fi,..., fr. 

Let g = aX¢4+--- be an element of I of degree d and leading coefficient a. Then 
a € (a1,...,@%) (Problem 1). 

Case 1: d > dx. Since d; < d;+, for all i (Problem 2), we have d > d; fori =1,...,k. 
Now a= a c;0a; with the cio € R. Define 


k 
go = > cio XT fy 
i=l 


so that go has degree d and leading coefficient a, and consequently g — go has degree 
greater than d. Having defined go,...,9r € (fi,---, f~) such that g — }7\_» gi has degree 
greater than d+ 1, say 


r 


9-69 =OXOT1 SE ., 
i=0 


(The argument is the same if the degree is greater than d+7r+1.) Now b€ (ai,..., ax) 
(Problem 1 again), so 


k 
b= s Cir 41% 
i=l 
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with ¢j,,41 € R. We define 


k 


d+r+1—d; 
Gr+1 = s Cepia” fi 
i=l 


so that g — pay gi has degree greater than d+r-+1. Thus 


oo oo 6k 
9= a-axis 
r=0 r=0 i=1 
and it follows upon reversing the order of summation that g € (fi,..., fx). (The reversal 


is legal because the inner summation is finite. For a given nonnegative integer 7, there 
are only finitely many terms of the form bX/.) 

Case 2: d < dz. As above, a € (a1,...,@%), 80 there is a smallest m between 1 and 
k such that a € (a1,...,@m). It follows that d > d,, (Problem 3). As in case 1 we have 
a=)", qa; with c; € R. Define 


h= Saxe fi € (fiy--- fx) CT. 


i=l 


The leading coefficient of h is a, so the degree of g — h is greater than d. We replace g by 
g — hand repeat the procedure. After at most d, — d iterations, we produce an element 
g —>=h,; in I of degree at least d,, with all h; € (fi,..., f,). By the analysis of case 1, 


GEC ta FR): 


Problems For Section 8.2 


1. Justify the step a € (a1,...,a,) in (8.2.3). 

2. Justify the step d; < dj, in (8.2.3). 

3. Justify the step d > dy», in (8.2.3). 

4. Let R be a subring of the ring S, and assume that S$ is finitely generated as an algebra 
over R. In other words, there are finitely many elements 71,...,%,, € S such that the 
smallest subring of S containing the x; and all elements of R is S itself. Show that S 
is a homomorhic image of the polynomial ring R[X1,..., Xn]. 


5. Continuing Problem 4, show that if R is Noetherian, then S' is also Noetherian. 


8.3 The Nullstellensatz: Preliminaries 


We have observed that every variety V defines an ideal [(V) and every ideal I defines a 
variety V(I). Moreover, if [(Vi) = I(V2), then V; = V2 by (6) of (8.1.3). But it is entirely 
possible for many ideals to define the same variety. For example, the ideals (f) and (f”) 
need not coincide, but their zero-sets are identical. Appearances to the contrary, the two 
statements in part (6) of (8.1.3) are not symmetrical. A variety V is, by definition, always 
expressible as V(S) for some collection S of polynomials, but an ideal J need not be of the 
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special form I(X). Hilbert’s Nullstellensatz says that if two ideals define the same variety, 
then, informally, the ideals are the same “up to powers”. More precisely, if g belongs to 
one of the ideals, then g” belongs to the other ideal for some positive integer r. Thus the 
only factor preventing a one-to-one correspondence between ideals and varieties is that 
a polynomial can be raised to a power without affecting its zero-set. In this section we 
collect some results needed for the proof of the Nullstellensatz. We begin by showing that 
each point of A” determines a maximal ideal. 


8.3.1 Lemma 


If a = (a1,...,@n) € A”, then J = (X1 — aj,..., Xn — Gn) is a maximal ideal. 


Proof. Suppose that I is properly contained in the ideal J, with f € J\J. Apply the 
division algorithm n times to get 


f = A(X) — a1) + Ag(X2 —ag)+ An (Xn —Gn) +b 


where Ay € k[Xy,...,Xn], Ag € k[Xo,...,Xn], -.., An € k[Xn], 6 © k. Note that b 
cannot be 0 since f ¢ I. But f € J, so by solving the above equation for b we have b € J, 
hence 1 = (1/b)b € J. Consequently, J = k[X1,...,Xn]. & 


The following definition will allow a precise statement of the Nullstellensatz. 


8.3.2 Definition 


The radical of an ideal I (in any commutative ring R) is the set of all elements f € R 
such that f” € I for some positive integer r. 

A popular notation for the radical of I is VI. If f’ and g* belong to J, then by the 
binomial theorem, (f + g)"**~1 € J, and it follows that VT is an ideal. 


8.3.3 Lemma 
If I is any ideal of k[X1,..., Xp], then VI C IV(J). 


Proof. If f € VI, then f’ € I for some positive integer r. But then f” vanishes on V(J), 
hence so does f. Therefore fe IV(I). & 


The Nullstellensatz states that IV(I) = VI, and the hard part is to prove that 
IV(I) C VI. The technique is known as the “Rabinowitsch trick”, and it is indeed 
very clever. Assume that f € IV(I). We introduce a new variable Y and work in 
k[X1,...,Xn,Y]. If I is an ideal of k[X,,...,X,], then by the Hilbert basis theorem, 
I is finitely generated, say by f1,..., fm. Let I* be the ideal of k[X,,...,Xn,Y] gener- 
ated by fi,...,fm,1—Yf. [There is a slight ambiguity: by f;(41,...,Xn,Y) we mean 
fi(%1,...,Xn), and similarly for f.] At an appropriate moment we will essentially set Y 
equal to 1/f and come back to the original problem. 
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8.3.4 Lemma 
If (a1,.--,@n,@n41) is any point in A"*+ and (a1,...,a@n,) € V(J) (in other words, the f;, 
i=1,...,m, vanish at (a1,...,@n)), then (a1,...,@n,@n41) ¢ VU"). 


Proof. We are assuming that f € IV (J), so that f vanishes on the zero-set of {f1,..., fm}. 
In particular, f(a1,...,@,) = 0. The value of 1—Yf at (a1,...,@n,@n41) is therefore 
1—dn4if(ai,...,@n) = 1—an41(0) =1 40. But 1—Yf € I*, so (a1,...,@n, Gn41) does 
not belong to the zero-set of [*. & 


8.3.5 Lemma 


If (a1,..-,@n,@n41) is any point in A"*? and (a1,...,@n) ¢ V(J), then (a1,...,@n,@n41)¢ 
V(I*). Consequently, by (8.3.4), VU*) = 0. 


Proof. By hypothesis, f;(a1,..-,@n,@n+1) 4 0 for some i, and since f; € I*, (a1,..-,@n41) 
cannot belong to the zero-set of I*. d& 


At this point we are going to assume what is called the weak Nullstellensatz, namely 
that if I is a proper ideal of kX 1,...,X,], then V(J) is not empty. 


8.3.6 Lemma 
There are polynomials g1,...,gm,h € k[X1,..., Xn, Y] such that 


1= So gfi+h-Yf). (1) 


i=l 


This equation also holds in the rational function field k(X1,..., Xn, Y) consisting of quo- 
tients of polynomials in k[X,,...,Xn,Y]. 


Proof. By (8.3.4) and (8.3.5), V(UI*) = 0, so by the weak Nullstellensatz, I* = k[Xi,..., 
Xy,Y]. In particular, 1 € I*, and since J* is generated by fi,...,fm,1—Yf, there is an 
equation of the specified form. The equation holds in the rational function field because 
a polynomial is a rational function. & 


8.3.7 The Rabinowitsch Trick 
The idea is to set Y = 1/f, so that (1) becomes 


DES GG ei Xp lf (Mie, Ma) i ay es (2) 
w=1 


Is this legal? First of all, if f is the zero polynomial, then certainly f € VI, so we 
can assume f # 0. To justify replacing Y by 1/f, consider the ring homomorphism 
from k[X1,...,Xn,Y] to k(X1,...,X,) determined by X; > X;, i = 1,...,n, Y > 
1/f(X1,...,Xn). Applying this mapping to (1), we get (2). Now the right side of (2) isa 
sum of rational functions whose denominators are various powers of f. If f” is the highest 
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power that appears, we can absorb all denominators by multiplying (2) by f”. The result 
is an equation of the form 


m 


PS Se eee hg) Ry ah) 


i=l 


where the h; are polynomials in k[X1,...,X,]. Consequently, f7 eI. & 
The final ingredient is a major result in its own right. 


8.3.8 Noether Normalization Lemma 


Let A be a finitely generated k-algebra, where & is a field. In other words, there are 
finitely many elements 21,...,%, in A that generate A over k in the sense that every 
element of A is a polynomial in the x;. Equivalently, A is a homomorphic image of the 
polynomial ring k[X1,...,X,] via the map determined by X; > 2;,i=1,...,n. 

There exists a subset {y1,...,y,} of A such that the y; are algebraically independent 
over & and A is integral over k[yi,..., yr]. 


Proof. Let {21,...,2,-} be a maximal algebraically independent subset of {21,...,a@n}. 
If n = r we are finished, since we can take y; = x; for all 7. Thus assume n > r, in which 
case £1,...,2n are algebraically dependent over k. Thus there is a nonzero polynomial 
f © k[M,...,Xn] such that f(a,...,¢n) = 0. We can assume n > 1, for if mn = 1 and 
r =0, then A = k[x,] and we can take {y1,...,y,} to be the empty set. 

We first assume that & is infinite and give a proof by induction on n. (It is possible to 
go directly to the general case, but the argument is not as intricate for an infinite field.) 
Decompose f into its homogeneous components (sums of monomials of the same degree). 
Say that g is the homogeneous component of maximum degree d. Then, regarding g as a 
polynomial in X,, whose coefficients are polynomials in the other X;, we have, relabeling 
variables if necessary, g(X1,...,Xn—1,1) # 0. Since k is infinite, it follows from (8.1.3) 
part (8) that there are elements a1,...,@n—1 € k such that g(ai,...,@,-1,1) 4 0. Set 
2, = 1 — Ly, t=1,...,n—1, and plug into f(21,...,2,) = 0 to get an equation of the 
form 

g(a1,.--,@n—1, 1) + terms of degree less than d in x, = 0. 
A concrete example may clarify the idea. If f(a1,22) = g(v1,22) = vjx3 and x = 
Z, + a,%2, then the substitution yields 


2 2.2) ,.3 
(2{ + 2ay 2142 + ajx5) x5 


which indeed is g(a1,1)23 plus terms of degree less than 5 in x2. Divide by 
g(a@1,---,@n-1,1) # 0 to conclude that x, is integral over B = k[z,...,2n-1]. By 


the induction hypothesis, there are elements yi,...,y, algebraically independent over 
k such that B is integral over k[y,,...,y,]. But the x;,7 <n, are integral over B since 
XL; = % + a;%,. By transitivity (see (7.1.4)), 71,...,%, are integral over k[y),..., yr]. 


Thus (see (7.1.5)) A is integral over klyi,..., yr]. 
Now we consider arbitrary k. As before, we produce a nonzero polynomial f such 
that f(r1,...,2%n) = 0. We assign a weight w; = s”~* to the variable X;, where s is a 
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large positive integer. (It suffices to take s greater than the total degree of f, that is, 
the sum of the degrees of all monomials in f.) If h = AXy"---X@" is a monomial of f, 
we define the weight of h as w(h) = 37%, ajw;. The point is that if h’ = uX?.--Xbn, 
then w(h) > w(h’) iff h > h’ in the lexicographic ordering, that is, for some m we have 
a; = b; fori < m, and a@m41 > bm4i. We take h to be the monomial of maximum weight. 
(If two monomials differ in the lexicographic ordering, they must have different weights.) 
Set 2, =a; —2¥@,1<i<n-—1, and plug into f(21,...,2,) =0 to get 


cx’) + terms of lower degree in a, = 0. 
For example, if f(21,22) = h(a, v2) = v372, then 2] = 21 + r5" gives 
(22 + 3220 + S203" + 03") a2 


and w(h) = 3w, + 2w2 = 3w, + 2 since s"-? = s° = 1. Thus 2, is integral over 
B=kj[z,...,2n—1], and an induction argument finishes the proof as in the first case. & 


8.3.9 Corollary 


Let B be a finitely generated k-algebra, where k is a field. If I is a maximal ideal of B, 
then B/TI is a finite extension of k. 


Proof. The field & can be embedded in B/I viac > c+I,c ek. [If c € J, 
c #0, then c'c = 1 € J, a contradiction.] Since A = B/I is also a finitely gener- 
ated k-algebra, it follows from (8.3.8) that there is a subset {y1,...,y,} of A with the 
y; algebraically independent over & and A integral over k[yi,...,y,]. Now A is a field 
(because I is a maximal ideal), and therefore so is k[y1,..., yr] (see the Problems in Sec- 
tion 7.1). But this will lead to a contradiction if r > 1, because 1/y, ¢ klyi,..., yr]. 
(If 1/y1 = g(yi,---,¥r) © klyi,---, yr], then yig(y1,---, Yr) = 1, contradicting algebraic 
independence.) Thus r must be 0, so A is integral, hence algebraic, over the field k. 
Therefore A is generated over k by finitely many algebraic elements, so by (3.3.3), Aisa 
finite extension of k. dé 


8.3.10 Corollary 


Let A be a finitely generated k-algebra, where & is a field. If A is itself a field, then A is 
a finite extension of k. 


Proof. As in (8.3.9), with B/I replaced by A. & 


Problems For Section 8.3 


1. Let S be a multiplicative subset of the ring R (see (2.8.1)). If J is an ideal that is 
disjoint from S$, then by Zorn’s lemma, there is an ideal J that is maximal among 
ideals disjoint from S. Show that J must be prime. 
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2. Show that the radical of the ideal J is the intersection of all prime ideals containing J. 
(If f’ € I C P, P prime, then f € P. Conversely, assume f ¢ VI. With a clever 
choice of multiplicative set S, show that for some prime ideal P containing I, we have 


f¢P] 


3. An algebraic curve is a variety defined by a nonconstant polynomial in two variables. 
Show (using the Nullstellensatz) that the polynomials f and g define the same algebraic 
curve iff f divides some power of g and g divides some power of f. Equivalently, f and 
g have the same irreducible factors. 


4. Show that the variety V defined over the complex numbers by the two polynomials 
Y?— XZ and Z? — X7Y is the union of the line L given by Y = Z = 0, X arbitrary, 
and the set W of all (¢°,t*,t°), t € C. 


5. The twisted cubic is the variety V defined over the complex numbers by Y — X? and 
Z — X. In parametric form, V = {(t,t?,t?): t € C}. Show that V is irreducible. [The 
same argument works for any variety that can be parametrized over an infinite field.] 


6. Find parametrizations of the following algebraic curves over the complex numbers. (It 
is permissible for your parametrizations to fail to cover finitely many points of the 
curve.) 

(a) The unit circle x? + y? = 1; 
(b) The cuspidal cubic y? = 2°; 
(c) The nodal cubic y? = x? + 2°. 
7. Let f be an irreducible polynomial, and g an arbitrary polynomial, in k[z, y]. If f does 


not divide g, show that the system of simultaneous equations f(x,y) = g(x,y) = 0 has 
only finitely many solutions. 


8.4 The Nullstellensatz: Equivalent Versions 
And Proof 


We are now in position to establish the equivalence of several versions of the Nullstellen- 
satz. 


8.4.1 Theorem 


For any field & and any positive integer n, the following statements are equivalent. 


(1) Maximal Ideal Theorem The maximal ideals of k[X,,...,X,] are the ideals of 
the form (X1 — a1,..., Xn — Gn), @1,.-.,;@y € k. Thus maximal ideals correspond to 
points. 

(2) Weak Nullstellensatz If J is an ideal of k[Xj,...,Xn] and V(J) = 0, then J = 
k[X1,...,Xn]. Equivalently, if J is a proper ideal, then V(J) is not empty. 

(3) Nullstellensatz If J is an ideal of kLX1,..., X»], then 
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(4) k is algebraically closed. 


Proof. (1) implies (2). Let I be a proper ideal, and let J be a maximal ideal containing I. 
By (8.1.3), part (4), V(J) C V(J), so it suffices to show that V(J) is not empty. By (1), 
J has the form (X1 — a1,...,Xn— Gy). But then a = (a1,...,a@n) € V(J). [In fact 
V(J) = {a}-] 

(2) implies (3). This was done in Section 8.3. 

(3) implies (2). We use the fact that the radical of an ideal I is the intersection 
of all prime ideals containing J; see Section 8.3, Problem 2. Let J be a proper ideal of 
k[X1,...,Xn]. Then J is contained in a maximal, hence prime, ideal P. By the result just 
quoted, VI is also contained in P, hence VI is a proper ideal. By (3), [V(Z) is a proper 
ideal. But if V(Z) = 0, then by (8.1.3) part (7), IV(1) = k[X,..., Xn], a contradiction. 

(2) implies (1). If J is a maximal ideal, then by (2) there is a point a = (a1,...,@n) € 
V(I). Thus every f € I vanishes at a, in other words, I C I({a}). But (X1 —q,..., 
Xp, — Gn) = I({a}); to see this, decompose f € I({a}) as in the proof of (8.3.1). Therefore 
the maximal ideal J is contained in the maximal ideal (Xj — a1,...,Xn— Gn), and it 
follows that I = (Xj — a4,..., Xn — Gn). 

(4) implies (1). Let J be a maximal ideal of k[X,,...,X,], and let K = k[Xj,..., 
X,,|/I, a field containing an isomorphic copy of k viac > c+ I, cE k. By (8.3.9), Kk 
is a finite extension of k, so by (4), K = k. But then X; +I = a;+TJ for some a; € k, 
i=1,...,n. Therefore X; — a; is zero in k|X1,...,X»]/J, in other words, X; — a; € I. 
Consequently, I D> (X1 — a1,..., Xn — Gy), and we must have equality by (8.3.1). 

(1) implies (4). Let f be a nonconstant polynomial in k[X,] with no root in k. We 
can regard f is a polynomial in n variables with no root in A”. Let J be a maximal ideal 
containing the proper ideal (f). By (1), I is of the form (X1 — a1,..., Xn — an) = I({a}) 
for some a = (@1,...,@n) € A”. Therefore f vanishes at a, a contradiction. & 


8.4.2 Corollary 
If the ideals J and J define the same variety and a polynomial g belongs to one of the 


ideals, then some power of g belongs to the other ideal. 


Proof. If V(I) = V(J), then by the Nullstellensatz, VT = VJ. If g € I C VI, then g’ € J 
for some positive integer r. @& 


8.4.3 Corollary 


The maps V > I(V) and I — V(1J) set up a one-to-one correspondence between varieties 
and radical ideals (defined by I = V1). 


Proof. By (8.1.3) part 6, VI(V) = V. By the Nullstellensatz, IV(I) = VI = I for radical 
ideals. It remains to prove that for any variety V, I(V) is a radical ideal. If f” € I(V), 
then f", hence f, vanishes on V, so fe I(V). & 
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8.4.4 Corollary 


Let fi,..-,fr,g € k[X1,...,Xn], and assume that g vanishes wherever the f; all vanish. 
Then there are polynomials h1,...,h, € k[X1,..., X»] and a positive integer s such that 
G=hifit-:- thrfr. 


Proof. Let I be the ideal generated by f1,..., f». Then V(J) is the set of points at which 
all f; vanish, so that IV(Z) is the set of polynomials that vanish wherever all f; vanish. 
Thus g belongs to IV (I), which is VJ by the Nullstellensatz. Consequently, for some 
positive integer s, we have g* € J, and the result follows. d& 


Problems For Section 8.4 


1. Let f be a polynomial in k[X),...,X,], and assume that the factorization of f into 
irreducibles is f = f]'t--- f"". Show that the decomposition of the variety V(f) into 
irreducible subvarieties (Section 8.1, Problems 5 and 6) is given by V(f) = Uf_,V (fi). 

2. Under the hypothesis of Problem 1, show that IV(f) = (fi--: fr). 

3. Show that there is a one-to-one correspondence between irreducible polynomials in 
k[X1,...,Xp] and irreducible hypersurfaces (see (8.2.2))in A"(k), if polynomials that 
differ by a nonzero multiplicative constant are identified. 

4. For any collection of subsets X; of A", show that I(U;X;) = NiI(X;). 

5. Show that every radical ideal I of k[X1,...,X»] is the intersection of finitely many 
prime ideals. 

6. In Problem 5, show that the decomposition is unique, subject to the condition that 
the prime ideals P are minimal, that is, there is no prime ideal Q with I CQ C P. 

7. Suppose that X is a variety in A?, defined by equations f(x,y) =--- = fm(x,y) = 90, 
m > 2. Let g be the greatest common divisor of the f;. If g is constant, show that X 
is a finite set (possibly empty). 

8. Show that every variety in A? except for A? itself is the union of a finite set and an 
algebraic curve. 

9. Give an example of two distinct irreducible polynomials in k[|X,Y] with the same 
zero-set, and explain why this cannot happen if k is algebraically closed. 

10. Give an explicit example of the failure of a version of the Nullstellensatz in a non- 
algebraically closed field. 


8.5 Localization 


8.5.1 Geometric Motivation 


Suppose that V is an irreducible variety, so that I(V) is a prime ideal. A polynomial g 
will belong to I(V) if and only if it vanishes on V. If we are studying rational functions 
f/g in the neighborhood of a point « € V, we must have g(x) 4 0. It is very convenient 
to have every polynomial g ¢ I(V) available as a legal object, even though g may vanish 
at some points of V. The technical device that makes this possible is the construction of 
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the ring of fractions S~1R, the localization of R by S, where R = k[X1,..., Xp] and S is 
the multiplicative set R\ I(V). We will now study the localization process in general. 


8.5.2 Notation 


Recalling the setup of Section 2.8, let S be a multiplicative subset of the ring R, and 
S-'R the ring of fractions of R by S$. Let h be the natural homomorphism of R into 
SR, given by h(a) = a/1. If X is any subset of R, define S~-1X = {a/s: x € X,s € Sh. 
We will be especially interested in such a set when X is an ideal. 

If J and J are ideals of R, the product of I and J, denoted by IJ, is defined (as in 
(7.6.1)) as the set of all finite sums }°; viy;, 21 € I, ys € J. It follows from the definition 
that IJ is an ideal. The sum of two ideals has already been defined in (2.2.8). 


8.5.3 Lemma 
If I is an ideal of R, then S~'TJ is an ideal of S~!R. If J is another ideal of R, the 


@) S04 J) = S194 8-1; 
(ii) S~*(LJ) = (S“*D)(S7*); 
Gi Sans 5 Live. a 
(iv) S~'I is a proper ideal iff SOI = 0. 


Proof. The definition of addition and multiplication in S~!'R implies that S~'J is an 
ideal, and that in (i), (ii) and (iii), the left side is contained in the right side. The reverse 
inclusions in (i) and (ii) follow from 


Gi 5s tes ab ab 
s t st > st st’ 


To prove (iii), let a/s = b/t wherea € I, b € J, s,t € S. There exists u € S such that 
u(at — bs) = 0. Then a/s = uat/ust = ubs/ust € S7'(IN J). 

Finally, if s € SJ then 1/1 = s/s € S-'I, so S-1I = S-!R. Conversely, if 
S-'I = S“!R, then 1/1 = a/s for some a € I, s € S. There exists t € S such that 
t(s—a)=0,soat=stEeSnl. & 


Ideals in S~1R must be of a special form. 


8.5.4 Lemma 


If J is an ideal of S~'R and I=h71(J), then J is an ideal of Rand S~'I = J. 
Proof. I is an ideal by the basic properties of preimages of sets. Let a/s € S~'I, with 
aé€Iandse€S. Then a/1 € J, so a/s = (a/1)(1/s) € J. Conversely, let a/s € J, with 
aé R,s€S. Then h(a) =a/1 =(a/s)(s/1)€ J,soaelanda/seS'l. & 


Prime ideals yield sharper results. 


8.5. LOCALIZATION 15 


8.5.5 Lemma 


If I is any ideal of R, then I C h~1(S~'Z), with equality if I is prime and disjoint from 
S. 


Proof. If a € I, then h(a) = a/1 € S~'I. Thus assume that J is prime and disjoint from 
S, and let a€ h7'(S~'I). Then h(a) = a/1 € S~'I, so a/1 = b/s for some bE T, s € S. 
There exists t € 9 such that t(as —b) = 0. Thus ast = bt € I, with st ¢ I since SOI = 0. 
Since J is prime, we haveacl. & 


8.5.6 Lemma 
If I is a prime ideal of R disjoint from $, then S~'J is a prime ideal of S~!R. 


Proof. By (8.5.3), part (iv), S~1I is a proper ideal. Let (a/s)(b/t) = ab/st € S~'T, with 
a,b € R, s,t € S. Then ab/st = c/u for some c € I, u € S. There exists v € S such that 
v(abu — cst) = 0. Thus abuv = cstv € I, and uv ¢ I because SMI =f. Since I is prime, 
ab € I, hence a € I or b€ I. Therefore either a/s or b/t belongs to S~'I. & 


The sequence of lemmas can be assembled to give a precise conclusion. 


8.5.7 Theorem 


There is a one-to-one correspondence between prime ideals P of R that are disjoint from 
S and prime ideals Q of S~1R, given by 


PSP and Q— h7'(Q). 


Proof. By (8.5.4), S~1(h—1(Q)) = Q, and by (8.5.5), h-1(S-1P) = P. By (8.5.6), S-1P 
is a prime ideal, and h~!(Q) is a prime ideal by the basic properties of preimages of sets. 
If h~1(Q) meets S, then by (8.5.3) part (iv), Q = S~'(h-1(Q)) = S“1R, a contradiction. 
Thus the maps P — S~!P and Q — h~1(Q) are inverses of each other, and the result 
follows. @& 


8.5.8 Definitions and Comments 


If P is a prime ideal of R, then S = R\ P is a multiplicative set. In this case, we write 
R(P) for S~'R, and call it the localization of R at P. (The usual notation is Rp, but it’s 
easier to read without subscripts.) If J is an ideal of R, we write I(P) for S~'I. We are 
going to show that R(P) is a local ring, that is, a ring with a unique maximal ideal. First 
we give some conditions equivalent to the definition of a local ring. 


8.5.9 Proposition 


For a ring R, the following conditions are equivalent. 


(i) R is a local ring; 


(ii) There is a proper ideal J of R that contains all nonunits of R; 
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(iii) The set of nonunits of R is an ideal. 


Proof. (i) implies (ii). If a@ is a nonunit, then (a) is a proper ideal, hence is contained in 
the unique maximal ideal J. 

(ii) implies (iii). If @ and 6 are nonunits, so are a+ 6 and ra. If not, then J contains 
a unit, so J = R, a contradiction. 

(iii) implies (i). If J is the ideal of nonunits, then J is maximal, because any larger 
ideal J would have to contain a unit, so that J = R. If H is any proper ideal, then H 
cannot contain a unit, so H C I. Therefore J is the unique maximal ideal. & 


8.5.10 Theorem 
R(P) is a local ring. 


Proof. Let Q be a maximal ideal of R(P). Then Q is prime, so by (8.5.7), Q = I(P) for 
some prime ideal J of R that is disjoint from S$, in other words, contained in P. Thus 
Q =I1(P) C P(P). If P(P) = R(P), then by (8.5.3) part (iv), P is not disjoint from 
S = R\P, which is impossible. Therefore P(P) is a proper ideal containing every maximal 
ideal, so it must be the unique maximal ideal. & 


If R is an integral domain and S is the set of all nonzero elements of R, then S~!R is 
the quotient field of R. In this case, S~!R is a local ring, because any field is a local ring. 
({0} is the unique maximal ideal.) Alternatively, we can appeal to (8.5.10) with P = {0}. 


8.5.11 Localization of Modules 


If M is an R-module and S a multiplicative subset of R, we can essentially repeat the 
construction of Section 2.8 to form the localization S~!M of M by S, and thereby divide 
elements of M by elements of S. If x,y € M and s,t € S, we call (z,s) and (y,t) 
equivalent if for some u € S, u(ta — sy) = 0. The equivalence class of (x, s) is denoted by 
x/s, and addition is defined by 


x Yen oy 


t st 


If a/s € S~'R and x/t € S~'M, we define 


ax ax 


st st’ 


In this way, S~!M becomes an S~!R-module. Exactly as in (8.5.3), if M and N are 
submodules of a module L, then 


S-'(M+N)=S"'M+S'N and S7"'(MNN)=S'MnS"'N. 


Further properties will be given in the exercises. 
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Problems For Section 8.5 


1. Let M be a maximal ideal of R, and assume that for every x € M,1+4 2 is a unit. 
Show that R is a local ring (with maximal ideal MM). [Show that if « ¢ M, then x isa 
unit, and apply (8.5.9).] 

2. Show that if p is prime and n is a positive integer, then Zpn is a local ring with maximal 
ideal (p). 

3. Let R be the ring of all n by n matrices with coefficients in a field F’. If A is a nonzero 
element of R and 1 is the identity matrix, is {1, A, A?,... } always a multiplicative set? 


Let S be a multiplicative subset of the ring R. We are going to construct a mapping 
from R-modules to S~!R-modules, and another mapping from R-module homomorphisms 
to S~!R-module homomorphisms, as follows. If M is an R-module, we let M — S71M. 
If f: M — N is an R-module homomorphism, we define S~!f: S~1M — S71!N by 


x f(x) 


s s 
Since f is a homomorphism, so is S~! f. 


4. If g: N — L and composition of functions is written as a product, show that S~!(gf) = 
S~'(g)S~1(f), and if 1y7 is the identity mapping on M, then S~1(1)7) = 1g-1,4. We 
say that S~! is a functor from the category of R-modules to the category of S~!R- 
modules. This terminology will be explained in great detail in Chapter 10. 


5. If 
f g 
M—- N > L 
is an exact sequence, show that 


ce | S-1g 
StM 5 SIN =| §S*1L 


is exact. We say that S~! is an exact functor. Again, we will study this idea in 
Chapter 10. 

6. Let R be the ring of rational functions f/g with f,g € k[X1,...,Xn] and g(a) 4 0, 
where a = (a1,...,@p) is a fixed point in A”. Show that R is a local ring, and identify 
the unique maximal ideal. 

7. If M is an R-module and S is a multiplicative subset of R, denote S~!M by Msg. If 
N is a submodule of M, show that (M/N)s = Ms/Ng. 


8.6 Primary Decomposition 


We have seen that every radical ideal in k[X,,..., X,] can be expressed as an intersection 
of finitely many prime ideals (Section 8.4, Problem 5). A natural question is whether a 
similar result holds for arbitrary ideals. The answer is yes if we generalize from prime to 
primary ideals. 
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8.6.1 Definitions and Comments 


The ideal Q in the ring R is primary if Q is proper and whenever a product ab belongs to 
Q, either a € Q or b” € Q for some positive integer n. [The condition on b is equivalent 
to b€ /Q.| An equivalent statement is that R/Q 4 0 and whenever (a + Q)(b+Q) =0 
in R/Q, either a+ Q = 0 or (b+ Q)” = 0 for some positive integer n. This says that if 
b+ Q is a zero-divisor in R/Q, then it is nilpotent, that is, some power of b+ Q is 0. 

It follows from the definition that every prime ideal is primary. Also, if Q is primary, 
then /Q is the smallest prime ideal containing Q. [Since /Q is the intersection of all 
prime ideals containing Q (Section 8.3, Problem 2), it suffices to show that /Q is prime. 
But if ab” € Q, then a” € Q or b”” © Q for some m, so either a or b must belong to 
J/Q. Note also that since Q is proper, it is contained in a maximal, hence prime, ideal, 
so VQ is also proper.] 

If Q is primary and \/Q = P, we say that Q is P-primary. 


8.6.2 Examples 


1. In Z, the primary ideals are {0} and (p"), where p is prime. In Ze, 2 and 3 are 
zero-divisors that are not nilpotent, and a similar situation will occur in Z,, whenever 
more than one prime appears in the factorization of m. 

2. Let R = k[X,Y] where k is any field, and take Q = (X,Y°%), the ideal generated by 
X and Y?. This is a nice example of analysis in quotient rings. We are essentially setting 
X and Y® equal to zero, and this collapses the ring R down to polynomials ag+a,Y +a2Y?, 
with the a; € k and arithmetic mod Y°. Formally, R/Q is isomorphic to k[Y]/(Y°). The 
zero-divisors in R/Q are of the form cY + dY?, c € k, and they are nilpotent. Thus Q is 
primary. If f € R, then the only way for f not to belong to the radical of @ is for the 
constant term of f to be nonzero. Thus /Q = (X,Y), a maximal ideal by (8.3.1). 

Now we claim that @ cannot be a power of a prime ideal; this will be a consequence 
of the next result. 


8.6.3 Lemma 
If P is a prime ideal, then for every positive integer n, VP” = P. 


Proof. Since P is a prime ideal containing P”, VP" C P. If « € P, then x” € P”, so 
crEVvP". he 


Returning to Example 2 of (8.6.2), if Q = (X,Y?) is a prime power P”, then its radical 
is P, so P must be (X,Y). But X € Q and X ¢ P",n > 2; since Y belongs to P but 
not Q, we have reached a contradiction. 

After a preliminary definition, we will give a convenient sufficient condition for an 
ideal to be primary. 


8.6.4 Definition 


The nilradical N(R) of a ring R is the set of nilpotent elements of R, that is, {a € R: 
x” = 0 for some positive integer n}. Thus \V(R) is the radical of the zero ideal, which is 
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the intersection of all prime ideals of R. 


8.6.5 Proposition 


If the radical of the ideal @ is maximal, then Q is primary. 


Proof. Since \/Q is maximal, it must be the only prime ideal containing Q. By the 
correspondence theorem and the fact that the preimage of a prime ideal is a prime ideal 
(cf. (8.5.7)), R/Q has exactly one prime ideal, which must coincide with V(R/Q). Any 
element of R/Q that is not a unit generates a proper ideal, which is contained in a 
maximal ideal, which again must be V’(R/Q). Thus every element of R/Q is either a unit 
or nilpotent. Since a zero-divisor cannot be a unit, every zero-divisor of R/Q is nilpotent, 
so Q is primary. & 


8.6.6 Corollary 


If M is a maximal ideal, then M” is M-primary for all n = 1,2,.... 
Proof. By (8.6.3), the radical of MM” is M, and the result follows from (8.6.5). & 


Here is another useful property. 


8.6.7 Proposition 


If Q is a finite intersection of P-primary ideals Q;, 7 = 1,...,n, then @ is P-primary. 


Proof. First note that the radical of a finite intersection of ideals is the intersection of the 
radicals (see Problem 1). It follows that the radical of Q is P, and it remains to show 
that Q is primary. If ab € Q but a ¢ Q, then for some i we have a ¢ Q;. Since Q; is 
P-primary, b belongs to P = /Q;. But then some power of b belongs to Q. @& 


We are going to show that in a Noetherian ring, every proper ideal J has a primary 
decomposition, that is, J can be expressed as a finite intersection of primary ideals. 


8.6.8 Lemma 


Call an ideal I irreducible if for any ideals J and K, J = JK implies that J = J or 
I=K. If R is Noetherian, then every ideal of R is a finite intersection of irreducible 
ideals. 


Proof. Suppose that the collection S of all ideals that cannot be so expressed is nonempty. 
Since R is Noetherian, S has a maximal element J, necessarily reducible. Let [= JN Kk, 
where I is properly contained in both J and K. By maximality of J, the ideals J and K are 
finite intersections of irreducible ideals, and consequently so is J, contradicting ES. & 
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If we can show that every irreducible proper ideal is primary, we then have the desired 
primary decomposition. Let us focus on the chain of reasoning we will follow. If I is 
an irreducible proper ideal of R, then by the correspondence theorem, 0 is an irreducible 
ideal of the Noetherian ring R/I. If we can show that 0 is primary in R/J, then again by 
the correspondence theorem, J is primary in R. 


8.6.9 Primary Decomposition Theorem 


Every proper ideal in a Noetherian ring R has a primary decomposition. (We can drop 
the word “proper” if we regard R as the intersection of the empty collection of primary 
ideals.) 


Proof. By the above discussion, it suffices to show that if 0 is an irreducible ideal of R, 
then it is primary. Let ab = 0 with a # 0. Since R is Noetherian, the sequence of 
annihilators 


annb C annb? C annb? C--- 
stabilizes, so ann b” = annb”*! for some n. If we can show that 
(a) (b") =0 


we are finished, because a 4 0 and the zero ideal is irreducible (by hypothesis). Thus let 
x = ca = db” for some c,d € R. Then br = cab = db"*! = 0 (because ab = 0), so d 
annihilates 6°+!, hence d annihilates b”. Thus x = db” =0. & 


Problems For Section 8.6 
1. If ),...,I, are arbitrary ideals, show that 


2. Let I be the ideal (XY —Z?) in k[X, Y, Z], where k is any field, and let R = k[X,Y, Z]/I. 
If P is the ideal (X + I, Z + I), show that P is prime. 

3. Continuing Problem 2, show that P?, whose radical is prime by (8.6.3) and which is a 
power of a prime, is nevertheless not primary. 

4. Let R= k[X,Y], and let P; = (X), Po = (X,Y), Q = (X,Y). Show that P, is prime 
and P? and Q are Ps-primary. 

5. Continuing Problem 4, let I = (X?, XY). Show that P, N P? and P,N Q are both 
primary decompositions of I. 


Notice that the radicals of the components of the primary decomposition (referred to 
as the primes associated with I) are P, and P» in both cases. [P, is prime, so VP, = P,; 
P, C JQ and P, is maximal, so Py = VQ;] Uniqueness questions involving primary 
decompositions are treated in detail in textbooks on commutative algebra. 
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6. We have seen in Problem 5 of Section 8.4 that every radical ideal in R = k[Xy,..., Xn] 
is the intersection of finitely many prime ideals. Show that this result holds in an 
arbitrary Noetherian ring R. 


7. Let R = k[X,Y] and let I, be the ideal (X°, XY,Y"). Show that for every positive 
integer n, I, is a primary ideal of R. 


8.7 Tensor Product of Modules Over a Commutative 
Ring 


8.7.1 Motivation 


In many areas of algebra and its applications, it is useful to multiply, in a sensible way, an 
element x of an R-module M by an element y of an R-module N. In group representation 
theory, M and WN are free modules, in fact finite-dimensional vector spaces, with bases 
{z;} and {y;}. Thus if we specify that multiplication is linear in each variable, then we 
need only specify products of x; and y;. We require that the these products, to be denoted 
by x; ® y;, form a basis for a new R-module T. 


If f: R — S is a ring homomorphism and M is an S-module, then M becomes an 
R-module via rx = f(r)z, r € R, x € M. This is known as restriction of scalars. 
In algebraic topology and algebraic number theory, it is often desirable to reverse this 
process. If M is an R-module, we want to extend the given multiplication rz, r € R, 
x € M, to multiplication of an arbitrary s € S by « € M. This is known as extension of 
scalars, and it becomes possible with the aid of the tensor product construction. 


The tensor product arises in algebraic geometry in the following way. Let M be the 
coordinate ring of a variety V in affine space A™, in other words, M is the set of all 
polynomial functions from V to the base field k. Let N be the coordinate ring of the 
variety W in A”. Then the cartesian product V x W is a variety in A”*”, and its 
coordinate ring turns out to be the tensor product of M and N. 


Let’s return to the first example above, where M and WN are free modules with bases 
{x;} and {y;}. Suppose that f is a bilinear map from M x N to an R-module P. (In other 
words, f is R-linear in each variable.) Information about f can be completely encoded 
into a function g of one variable, where g is an R-module homomorphism from T to P. 
We take g(x; ® yj) = f(xi,y;) and extend by linearity. Thus f is the composition of 
the bilinear map h from M x N to T specified by (;,y;) — 2; ® yj, followed by g. To 
summarize: 


Every bilinear mapping on M x N can be factored through T. 


The R-module T is called the tensor product of M and N, and we write T= M®RrN. 
We are going to construct a tensor product of arbitrary modules over a commutative ring, 
and sketch the generalization to noncommutative rings. 
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8.7.2 Definitions and Comments 


Let M and N be arbitrary R-modules, and let F' be a free R-module with basis M x N. 
Let G be the submodule of F' generated by the “relations” 


(x x’, y) (x,y) (25a); Gata = ea (e,9) 
(rz,y) —r(2,y); (a, ry) —r(x,y) 


where x, 2’ € M, y,y’ € N,r © R. Define the tensor product of M and N (over R) as 
T=M®rN=F/G 


and denote the element (x,y) +G of T by x@y. Thus the general element of T is a finite 
sum of the form 


t=) Oy (1) 


with «; € M and y; € N. It is important to note that the representation (1) is not 
necessarily unique. 
The relations force x © y to be linear in each variable, so that 


r@(yty)=r@y+rey, (+2) @y=reytsa ey, (2) 
r(e@®@y)=rx @y=xLOry. (3) 


See Problem 1 for details. Now if f is a bilinear mapping from M x N to the R-module 
P, then f extends uniquely to a homomorphism from F' to P, also called f. Bilinearity 
means that the kernel of f contains G, so by the factor theorem, there is a unique R- 
homomorphism g: T — P such that g(a ® y) = f(x,y) for alla ¢e M, y € N. As in 
(8.7.1), if we compose the bilinear map h: (z,y) > « ® y with g, the result is f. Again, 
we say that 


Every bilinear mapping on M x N can be factored through T. 


We have emphasized this sentence, known as a universal mapping property (abbrevi- 
ated UMP), because along with equations (1), (2) and (3), it indicates how the tensor 
product is applied in practice. The detailed construction we have just gone through can 
now be forgotten. In fact any two R-modules that satisfy the universal mapping property 
are isomorphic. The precise statement and proof of this result will be developed in the 
exercises. 

In a similar fashion, using multilinear rather than bilinear maps, we can define the 
tensor product of any finite number of R-modules. [In physics and differential geometry, 
a tensor is a multilinear map on a product M, x--- x M,, where each M; is either a finite- 
dimensional vector space V or its dual space V*. This suggests where the terminology 
“tensor product” comes from.] 

In the discussion to follow, M, N and P are R-modules. The ring R is assumed fixed, 
and we will usually write © rather than @p. 
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8.7.3. Proposition 
MENZFN@M. 


Proof. Define a bilinear mapping f: Mx N > N®@M by f(a2,y) = y®@a. By the UMP, 
there is a homomorphism g: M @® N > N ® M such that g(a ® y) = y® a. Similarly, 
there is a homomorphism g’: N ® M > M®@WN with g/(y®xz) = «@y. Thus g is an 
isomorphism (with inverse g’). d& 


8.7.4 Proposition 


M®(N®P)=(ME@N)OP. 


Proof. Define f: Mx N x P = (M@N)@P by f(a,y,z) = (x @y) @z. The UMP 
produces g: M x (N® P) > (MV @N) ® P with g((z, (y ® z))) = (4 @ y) ® z. [We are 
applying the UMP for each fixed « € M, and assembling the maps to produce g.] Since g 
is bilinear (by Equations (2) and (3)), the UMP yields h: M@(N@®P)—-(M®@N)@P 
with h(x ® (y ® z)) = (@ @ y) ® z. Exactly as in (8.7.3), we can construct the inverse of 
h, so h is the desired isomorphism. & 


8.7.5 Proposition 


ME(N@P)=(MO@N)E(M OP). 


Proof. Let f be an arbitrary bilinear mapping from M x (N @ P) to Q. If a € M, 
y € N, z € P, then f(z,y +z) = f(x,y) + f(a,z). The UMP gives homomorphisms 
gi: M@N > Qand go: M@P > Q such that g)(a@y) = f(x,y) and go(x@z) = f(a, z). 
The maps g; and gg combine to give g: (M @ N) 6(M @ P) = Q such that 


g((@ @ y) + (2" ®@ z)) = (@ @y) + go(2" @ 2). 
In particular, with 2’ = 2, 
g(x @y) + (e@@z)) = fla,y+ 2), 
so ifh: Mx (N@P) > M @(N @ P) is defined by 
h(z,y+z) =(e«#@y)+ (rz), 


then f = gh. Thus (M @ N) @ (M ®@ P) satisfies the universal mapping property, hence 
must be isomorphic to the tensor product. d& 


8.7.6 Proposition 


Regarding R as a module over itself, R®p M = M. 


Proof. The map (r,2) — ra of Rx M — M is bilinear, so there is a homomorphism 
g: R® M—M< such that g(r ® x) = ra. Define hh: M > R®M by h(x) =1@2. Then 
h(rz) =1@ra=rl®@®x=r za. Thus g is an isomorphism (with inverse h). d& 
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8.7.7 Corollary 


Let R™ be the direct sum of m copies of R, and M™ the direct sum of m copies of M. 
Then R” @M2=M™. 


Proof. By (8.7.5), R™ ®@ M is isomorphic to the direct sum of m copies of R ® M, which 
is isomorphic to M™ by (8.7.6). & 


8.7.8 Proposition 


R”™ @R" = R™ . Moreover, if {21,...,2,} is a basis for R™ and {y1,...,Yn} is a basis 
for .A”, then {7 @ y;,7 = 1,3...) j =1, +0147} is a basis for AR”. 


Proof. This follows from the discussion in (8.7.1). [The first assertion can also be proved 
by taking M = R” in (8.7.7).] & 


8.7.9 Tensor Product of Homomorphisms 


Let fi: Mi — N, and fo: Mz — No be R-module homomorphisms. The map (21,22) > 
fi(a1) ® fo(x2) of My x Mo into N; ® Nz is bilinear, and induces a unique f: M1 ® M2 — 
N; ® No such that 


f(%1 ® £2) = fi(%1) @ fo(@2), 1 © My, @2 © Mp. 


We write f = f1® fa, and call it the tensor product of f; and fo. Similarly, if g1: Ni — Pi 
and gg: No — P2, then we can compose gj ® gz with f; ® fo, and 


(91 © g2)(fi @ fo)(%¥1 @ 2) = gi fi(x1) ® g2fo(x2), 


hence 


(91 ® gz) ° (fi @ fa) = (91 © fi) @ (g2 9 fa). 


When M, = N, = V, a free R-module of rank m, and Mz = No = W, a free R- 
module of rank n, there is a very concrete interpretation of the tensor product of the 
endomorphisms f: V > V and g: W > W. If f is represented by the matrix A and g by 
the matrix B, then the action of f and g on basis elements is given by 


F(vj) = So agri, g(wr) = So darwe 
i k 
where 7 and j range from 1 to m, and k and / range from 1 to n. Thus 
(f ® g)(vj ® wi) = f(vj) ® g(wi) = oo aij det (Vi ®@ We): 
i,k 


The mn by mn matrix representing the endomorphism f®g: V@W — V @W is denoted 
by A® B and called the tensor product or Kronecker product of A and B. It is given by 


[ onB aygB OimB ) 
A@B=| : 
lee am2B ers 18 
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The ordering of the basis of V © W is 
V1 ® W1,.--,V1 ® Wn,---,Um ® W1,---,Um ® Wn- 


To determine the column of A ® B corresponding to vj; ® w, locate the a;;B block 
(i =1,...,m; j fixed) and proceed to column | of B. As we move down this column, the 
indices i and k vary according to the above ordering of basis elements. If this road map 
is not clear, perhaps writing out the entire matrix for m = 2 and n = 3 will help. 


Problems For Section 8.7 
1. Verify Equations (2) and (3) of (8.7.2). 
2. If m and n are relatively prime, show that Z,, ®z Z, = 0. 


3. If A is a finite abelian group and Q is the additive group of rationals, show that 
A ®z Q=0. Generalize to a wider class of abelian groups A. 

4. The definition of M ®p N via a universal mapping property is as follows. The tensor 
product is an R-module T along with a bilinear map h: M x N — T such that given 
any bilinear map f: M x N — P, there is a unique R-homomorphism g: T — P such 
that f = gh. See the diagram below. 


MxN“=+T 


sy 


P 


Now suppose that another R-module T’, along with a bilinear mapping h’: M x N => 
T’, satisfies the universal mapping property. Using the above diagram with P = T’ 
and f replaced by h’, we get a unique homomorphism g: T — T”’ such that h’ = gh. 
Reversing the roles of T and T’, we get g’: T’ > T such that h = g'h’. 

Show that T and T’ are isomorphic. 

5. Consider the element n@ in Z@Z,,, where x is any element of Z, and we are tensoring 
over Z, ie., R= Z. Show that n® a= 0. 

6. Continuing Problem 5, take x # 0 and regard n ® x as an element of nZ ® Z,, rather 
than Z ® Z,. Show that n®a #4 0. 

7. Let M,N, M’,N’ be arbitrary R-modules, where R is a commutative ring. Show that 
the tensor product of homomorphisms induces a linear map from Hompr(M, M’) @r 
Homa,(QN, N') to Hompr(M rN, M'®r N’). 

8. Let M be a free R-module of rank m, and N a free R-module of rank n. Show that 
there is an R-module isomorphism of Endr(M) ®@r Endr(N) and Endr(M ®@ N). 


8.8 General Tensor Products 


We now consider tensor products of modules over noncommutative rings. A natural 
question is “Why not simply repeat the construction of (8.7.2) for an arbitrary ring R?”. 
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But this construction forces 

rz @ sy =7r(x#® sy) =rs(x@ y) 
and 

rx @ sy = s(ra @y) = sr(x@y) 


which cannot hold in general if R is noncommutative. A solution is to modify the con- 
struction so that the tensor product T is only an abelian group. Later we can investigate 
conditions under which T has a module structure as well. 


8.8.1 Definitions and Comments 


Let M be aright R-module and N a left R-module. (We often abbreviate this as Mr and 
RN.) Let f: M x N > P, where P is an abelian group. The map f is biadditive if it is 
additive in each variable, that is, f(e+2’,y) = f(z,y)+f(2',y) and f(z, y+y’) = f(z, y)+ 
f(x,y’) for all x, a’ € M, y,y’ € N. The map f is R-balanced if f(xr,y) = f(a,ry) for all 
xe M,yeN,reéR. As before, the key idea is the universal mapping property: Every 
biadditive, R-balanced map can be factored through the tensor product. 


8.8.2 Construction of the General Tensor Product 


If Mp and RN, let F be the free abelian group with basis M x N. Let G be the subgroup 
of R generated by the relations 


(x+a',y) — (x,y) — (2,y); 
(z,y+y') — (zy) — (x,y); 
(zr, y) _ (x, ry) 
where v,2’ € M, y,y’ € N,r © R. Define the tensor product of M and N over R as 
T=M®pN=F/G 


and denote the element (x,y) -+G of T by x@ y. Thus the general element of T is a finite 
sum of the form 


t=) ai @ my. (1) 


The relations force the map h: (1,y) — «® y of M x N into T to be biadditive and 
R-balanced, so that 


r@(yty)=rc@yt+r@y’, (c«+2')®y=rc@yt+2' @y, (2) 
zr Q@y=xLOry. (3) 


If f is a biadditive, R-balanced mapping from M x N to the abelian group P, then f 
extends uniquely to an abelian group homomorphism from F' to P, also called f. Since f 
is biadditive and R-balanced, the kernel of f contains G, so by the factor theorem there 
is a unique abelian group homomorphism g: T — P such that g(x ® y) = f(x,y) for all 
xe€M,ye€EN. Consequently, gh = f and we have the universal mapping property: 
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Every biadditive, R-balanced mapping on M x N can be factored through T. 


As before, any two abelian groups that satisfy the universal mapping property are 
isomorphic. 


8.8.3. Bimodules 


Let R and S be arbitrary rings. We say that M is an S — R bimodule if M is both a left 
S-module and a right R-module, and in addition a compatibility condition is satisfied: 
(sx)r = s(xr) for all s € S, r € R. We often abbreviate this as 5 Mr. 

If f: R — S is a ring homomorphism, then S' is a left S-module, and also a right 
R-module by restriction of scalars, as in (8.7.1). The compatibility condition is satisfied: 
(sx)r = sxf(r) = s(ar). Therefore S$ is an S — R bimodule. 


8.8.4 Proposition 
If >Mp and rNr, then M @R N is an S —T bimodule. 


Proof. Fix s € S. The map (2,y) — sx @y of M x N into M @p N is biadditive and 
R-balanced. The latter holds because by the compatibility condition in the bimodule 
property of M, along with (3) of (8.8.2), 


s(ar) @y = (sx)r@y=sx@ry. 


Thus there is an abelian group endomorphism on M @p N such that x @y — sx @ y, 
and we use this to define scalar multiplication on the left by s. A symmetrical argument 
yields scalar multiplication on the right by t. To check the compatibility condition, 


[s(x @ y)|t = (sx ® y)t = sx @ yt = s(x @ yt) =sl(x@y)t]. hw 


8.8.5 Corollary 


If sMp and RN, then M ®pR N is a left S-module. If Mr and rNr, then M @R N isa 
right T-module. 


Proof. The point is that every module is, in particular, an abelian group, hence a Z 
module. Thus for the first statement, take T = Z in (8.8.4), and for the second statement, 
take S=Z. & 


8.8.6 Extensions 


As in Section 8.7, we can define the tensor product of any finite number of modules using 
multiadditive maps (additive in each variable) that are balanced. For example, suppose 
that Mr, rNg and gP. If f: Mx Nx P — G, where G is an abelian group, the condition 
of balance is 


flar,y, 2) =. F(z, ry, 2) and F(z, ys, z) = f(x,y, 82) 
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forallae M, ye N,2z¢€P,reéR,s€S. An argument similar to the proof of (8.7.4) 
shows that 

(a) M@rN®sP=(M @RN)@sP=M OR(N Bs P). 

If M is a right R-module, and N and P are left R-modules, then 

(b) M@r(N@P)2=(M @RrN)G(M @rP). 

This is proved as in (8.7.5), in fact the result can be extended to the direct sum of an 
arbitrary (not necessarily finite) number of left R-modules. 

If M is a left R-module, then exactly as in (8.7.6) and (8.7.7), we have 

(c) R®r M=M and 

(d) R°@M2=M™. 

Let My, and M2 be right R-modules, and let N; and N»2 be left R-modules. If 
fi: M, — Ny, and fo: My — No are R-module homomorphisms, the tensor product 
fi ® fe can be defined exactly as in (8.7.9). As before, the key property is 


(fi ® f2)(1 ® 2) = fi(x1) @ fa(xe) 


for all x, € My, x2 € Mo. 


8.8.7 Tensor Product of Algebras 


If A and B are algebras over the commutative ring R, then the tensor product A @r B 
becomes an R-algebra if we define multiplication appropriately. Consider the map of 
Ax Bx Ax B into A®p B given by 


(a,b, a’,b') = aa’ ® bb’, a,a’ € A, b,b' € B. 


The map is 4-linear, so it factors through the tensor product to give an R-module homo- 
morphism g: A® B® A® B- A®B such that 


gia®bea' @v)=aa' ® bi’. 
Now let h: (A® B) x (A® B) ~A®B@A®OB be the bilinear map given by 
h(u,v) =uU®v. 


If we apply h followed by g, the result is a bilinear map f: (A® B) x (A@B)-~ A@B 
with 


f(a@b,a QU) =aad' @ bv’, 


and this defines our multiplication (a@ b)(a’ @b’) on A® B. The multiplicative identity is 
14 ® 18, and the distributive laws can be checked routinely. Thus A ®p B is a ring that 
is also an R-module. To check the compatibility condition, note that if r € R, a,a’ € A, 
b, bv! € B, then 


r[(a@ da’ @b)] = [ra@d)|(@' ev) = (@e@d)[r(a ov); 


all three of these expressions coincide with raa’ @ bb’ = aa’ @ rob’. 
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Problems For Section 8.8 


We will use the tensor product to define the exterior algebra of an R-module M, where R 
is acommutative ring. If pis a positive integer, we form the tensor product M®pr---®RrM 
of M with itself p times, denoted by M®?. Let N be the submodule of M®? generated 
by those elements x1 ® +--+ @ @p, with x; € M for all 7, such that x; = x; for some i F j. 
The p*” exterior power of M is defined as 


A? M = M®?/N. 


In most applications, M is a free R-module with a finite basis 71,...,2 (withl1 <p<n), 
and we will only consider this case. To simplify the notation, we write the element 
a®b@®---@c+N of APM as ab---c. (The usual notation is aA bA--- Ac.) 


1. Let yi,..-,Yp € M. Show that if y; and y; are interchanged in the product yj --- Yp, 
then the product is multiplied by —1. 

2. Show that the products xj, ---x;,, where i; < +++ <%p, span A?M. 

3. Let f: M? — Q be a multilinear map from M? to the R-module Q, and assume that f 
is alternating, that is, f(mi,...,mp) = 0 if m; = m, for some i # j. Show that f 
can be factored through A?M, in other words, there is a unique R-homomorphism 
g: APM — Q such that g(y1---Y¥p) = f(yi,---1¥p). 


Let y; = ei aijtj,i=1,...,n. Since {x1,...,@n} is a basis for M, y; can be iden- 
tified with row i of A. By the basic properties of determinants, the map f(y1,..-,Yn) = 
det A is multilinear and alternating, and f(21,...,2) = 1, the determinant of the identity 
matrix. 


4. Show that 21 ---a, #0in A"M, and that {11 ---x,} is a basis for A"M. 


Let I = {i1,:+- ,ip}, where 4; <--- <%p, and write the product 2;,---a;, as zz. Let 
J be the complementary set of indices. (For example, ifn = 5, p = 3, and I = {1, 2,4}, 
then J = {3,5}.) Any equation involving x; € A?M can be multiplied by x; to produce 
a valid equation in A"M. 


5. Show that the products x; of Problem 2 are linearly independent, so that A?M is a 
free R-module of rank (; ). 


Roughly speaking, the exterior algebra of M consists of the A?M for all p. By con- 
struction, A‘!M = M and A?M = 0 for p > n, since some index must repeat in any 
element of A?M. By convention, we take A°M = R. Formally, the exterior powers are 
assembled into a graded R-algebra 


Ap BA, GAQEG::: 


where A, = A?M. Multiplication is defined as in the discussion after Problem 4, that 
is, if y1---Yp € Ap and 2 ---zq € Ag, then the exterior product yy +++ Yp%1 +++ %q belongs 
to Ap+q: 

A ring R is said to be graded if, as an abelian group, it is the direct sum of sub- 
groups R,, n = 0,1,2,..., with RmRyz C Rnim for all m,n > 0. [Example: R = 
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k[X1,..-,Xn], Rn = all homogeneous polynomials of degree n.] By definition, Ro is 

a subring of R (because RoRp C Ro), and each R,, is a module over Ro (because 

RoRn C Rn). 

6. Suppose that the ideal J = @n>1Rn is generated over R by finitely many elements 
X,...,2,, with a; € R,,. Show that R, C S = Rol[ai,...,x,] for all n = 0,1,..., so 
that R= S. 


7. Show that R is a Noetherian ring if and only if Ro is Noetherian and R is a finitely 
generated Ro-algebra. 


Chapter - 9 


> Introducing to the 
Noncommutative Algebra 


Chapter 9 


Introducing Noncommutative 
Algebra 


We will discuss noncommutative rings and their modules, concentrating on two fundamen- 
tal results, the Wedderburn structure theorem and Maschke’s theorem. Further insight 
into the structure of rings will be provided by the Jacobson radical. 


9.1 Semisimple Modules 


A vector space is the direct sum of one-dimensional subspaces (each subspace consists of 
scalar multiples of a basis vector). A one-dimensional space is simple in the sense that 
it does not have a nontrivial proper subspace. Thus any vector space is a direct sum of 
simple subspaces. We examine those modules which behave in a similar manner. 


9.1.1 Definition 
An R-module M is simple if M #0 and the only submodules of M are 0 and M. 


9.1.2 Theorem 


Let M be a nonzero R-module. The following conditions are equivalent, and a module 

satisfying them is said to be semisimple or completely reducible. 

(a) M is a sum of simple modules; 

(b) M is a direct sum of simple modules; 

(c) If N is a submodule of M, then N is a direct summand of M, that is, there is a 
submodule N’ of M such that M=N@N’. 


Proof. (a) implies (b). Let M be the sum of simple modules Mj, i € I. If J C I, denote 
ics Mj by M(J). By Zorn’s lemma, there is a maximal subset J of J such that the 
sum defining N = M(JJ) is direct. We will show that M = N. First assume that i € J. 


1 
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Then N/M M; is a submodule of the simple module M;, so it must be either 0 or M;. If 
NOM; = 0, then M(JU{i}) is direct, contradicting maximality of J. Thus NOM; = Mi, 
so M; C N. But if 2 € J, then M; C N by definition of N. Therefore M; © N for all 2, 
and since M is the sum of all the M;, we have M = N. 

(b) implies (c). This is essentially the same as (a) implies (b). Let N be a submodule 
of M, where M is the direct sum of simple modules M;, i € IJ. Let J be a maximal 
subset of IJ such that the sum N + M(J) is direct. If i ¢ J then exactly as before, 
M,N(N 8 M(J)) = Mi, so M; C N® M(J). This holds for i € J as well, by definition of 
M(J). It follows that M = N @ M(J). [Notice that the complementary submodule N’ 
can be taken as a direct sum of some of the original Mj.] 

(c) implies (a). First we make several observations. 

(1) If M satisfies (c), so does every submodule N. [Let N < M, so that M= NON’. 
If V is a submodule of N, hence of M, we have M=V OW. IfxeEN, then x =v+u, 
vé€V,weW, sow = 2-0 € N (using V < N). But v also belongs to N, and 
consequently N = (NOV) @®(NOW)=Ve(NNW),.] 

(2) If D= A®BOC, then A= (A+ B)N(A+C). [Ifa+b=a' +c, where a,a’ € A, 
be B,ce€C, then a’ —a=b-—c, and since D is a direct sum, we have b = c = 0 and 
a=a’. Thuisa+be€EA\ 

(3) If N is a nonzero submodule of M, then N contains a simple submodule. 

[Choose a nonzero x € N. By Zorn’s lemma, there is a maximal submodule V of N 
such that « ¢ V. By (1) we can write N= V @V’, and V’ 4 0 by choice of w and V. 
If V’ is simple, we are finished, so assume the contrary. Then V’ contains a nontrivial 
proper submodule Vj, so by (1) we have V’ = V; @ V2 with the V; nonzero. By (2), 
V=(V4+Vi)N(V+V,). Since x ¢ V, either x € V+V or x € V+ Va, which contradicts 
the maximality of V.] 

To prove that (c) implies (a), let N be the sum of all simple submodules of M. By (c) 
we can write M = NQ@N’. If N’ £0, then by (3), N’ contains a simple submodule V. 
But then V < N by definition of N. Thus V < NM N’ = 0, a contradiction. Therefore 
N'’=OandM=N. & 


9.1.3 Proposition 


Nonzero submodules and quotient modules of a semisimple module are semisimple. 


Proof. The submodule case follows from (1) of the proof of (9.1.2). Let N < M, where 
M =)>, M; with the M; simple. Applying the canonical map from M to M/N, we have 


M/N = 5°(M,+ N)/N. 


a 


This key idea has come up before; see the proofs of (1.4.4) and (4.2.3). By the second 
isomorphism theorem, (M; + N)/N is isomorphic to a quotient of the simple module M;. 
But a quotient of M; is isomorphic to M; or to zero, and it follows that M/N is a sum of 
simple modules. By (a) of (9.1.2), M/N is semisimple. d& 
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Problems For Section 9.1 


1. Regard a ring R as an R-module. Show that R is simple if and only if R is a division 
ring. 

2. Let M be an R-module, with x a nonzero element of M. Define the R-module homo- 
morphism f: R — Ra by f(r) = rv. Show that the kernel I of f is a proper ideal 
of R, and R/T is isomorphic to Ra. 


3. If M is a nonzero R-module, show that M is simple if and only if M & R/T for some 
maximal left ideal I. 

4, If M is a nonzero R-module, show that M is simple if and only if M is cyclic (that 
is, M can be generated by a single element) and every nonzero element of M is a 
generator. 

5. What do simple Z-modules look like? 

6. If F is a field, what do simple F'[X]-modules look like? 


7. Let V be an n-dimensional vector space over a field k. (Take n > 1 so that V ¥ 0.) 
If f is an endomorphism (that is, a linear transformation) of V and x € V, define 
fa = f(a). This makes V into a module over the endomorphism ring End;(V). Show 
that the module is simple. 


8. Show that a nonzero module M is semisimple if and only if every short exact sequence 
0—- N-— M > P — 0 splits. 


9.2 Two Key Theorems 


If M is a simple R-module, there are strong restrictions on a homomorphism either into 
or out of M. A homomorphism from one simple R-module to another is very severely 
restricted, as Schur’s lemma reveals. This very useful result will be important in the proof 
of Wedderburn’s structure theorem. Another result that will be needed is a theorem of 
Jacobson that gives some conditions under which a module homomorphism f amounts to 
multiplication by a fixed element of a ring, at least on part of the domain of f. 


9.2.1 Schur’s Lemma 


(a) If f € Homy(M, N) where M and N are simple R-modules, then f is either identically 
0 or an isomorphism. 


(b) If M is a simple R-module, then Endr(M) is a division ring. 


Proof. (a) The kernel of f is either 0 or M, and the image of f is either 0 or N. If f is 
not the zero map, then the kernel is 0 and the image is N, so f is an isomorphism. 

(b) Let f € Endp(M), f not identically 0. By (a), f is an isomorphism, and therefore 
is invertible in the endomorphism ring of M. & 


The next result prepares for Jacobson’s theorem. 
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9.2.2 Lemma 


Let M be a semisimple R-module, and let A be the endomorphism ring Endr(M). [Note 
that M is an A-module; if g € A we take gexv = g(x), x € M.] If m € M and 
f € End,4(M), then there exists r € R such that f(m) = rm. 

Before proving the lemma, let’s look more carefully at End4(M/). Suppose that f € 
End4(M) and « € M. If g € A then f(g(x)) = g(f(x)). Thus End,(M) consists of 
those abelian group endomorphisms of M that commute with everything in Endra(M). In 
turn, by the requirement that f(r) = rf(a#), Endr(M) consists of those abelian group 
endomorphisms of M that commute with R, more precisely with multiplication by r, for 
each r € R. For this reason, End4(M) is sometimes called the double centralizer of R. 

We also observe that the map taking r € R to multiplication by r is a ring homomor- 
phism of R into Endy(M). [Again use rf(x) = f(rx).] Jacobson’s theorem will imply 
that given any f in End,4(M) and any finite set S C M, some g in the image of this 
ring homomorphism will agree with f on S. Thus in (9.2.2), we can replace the single 
element m by an arbitrary finite subset of M. 


Proof. By (9.1.2) part (c), we can express M as a direct sum Rm @ N. Now if we have a 
direct sum U=V @W andu=v+u, vE V, we W, there is a natural projection of U 
on V, namely u — v. In the present case, let 7 be the natural projection of M on Rm. 
Then 7 € A and f(m) = f(mm) = 7f(m) € Rm. The result follows. & 


Before proving Jacobson’s theorem, we review some ideas that were introduced in the 
exercises in Section 4.4. 


9.2.3. Comments 


To specify an R-module homomorphism ~ from a direct sum V* = @7_,Vj to a direct sum 
W* = 6™,Wi, we must give, for every i and j, the i” component of the image of v; € Vj. 
Thus the homomorphism is described by a matrix [1;;], where w;; is a homomorphism from 
V; to W;. The i*” component of 7)(v;) is Wi;(v,;), so the i“ component of (v1 +--+: + Un) 
is 7-1 Viz (vj). Consequently, 


W(v1 +++ + Un) = [Wag] |: | - (1) 


Un 


This gives an abelian group isomorphism between Homr(V*,W*) and [HomR(V;, W;)], 
the collection of all m by n matrices whose 77 entry is an R-module homomorphism from 
V; to W;. If we take m = n and V; = W; = V for all 7 and j, then V* = W* = V”, the 
direct sum of n copies of V. Then the abelian group isomorphism given by (1) becomes 


Endr(V") = M,,(Endr(V)), (2) 


the collection of all n by n matrices whose entries areR-endomorphisms of V. Since 
composition of endomorphisms corresponds to multiplication of matrices, (2) gives a ring 
isomorphism as well. 
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9.2.4 Theorem (Jacobson) 


Let M be a semisimple R-module, and let A be the endomorphism ring Endr(M). If 
f € End,g(M) and m,...,m, € M, then there exists r € R such that f(m;) = rm, for 
alli=1,...,n. 


Proof. f induces an endomorphism f() of M”, the direct sum of n copies of M, via 
FO (ny +--+ Mm) = Frm) + + Flmn) 


where f(m;) belongs to the i” copy of M. Thus the matrix that represents f(” is the 
scalar matrix fI, where J is an n by n identity matrix. If B = Endr(M"), then since 
a scalar matrix commutes with everything, f(™ € Endg(M"). If mj,...,mn € M, then 
by (9.2.2), there exists r € R such that f™(m, +-:-mn) = r(m, +---mp). [Note that 
since M is semisimple, so is M".] This is equivalent to f(m;) = rm, for alli. & 


Before giving a corollary, we must mention that the standard results that every vector 
space over a field has a basis, and any two bases have the same cardinality, carry over if 
the field is replaced by a division ring. Also recall that a module is said to be faithful if 
its annihilator is 0. 


9.2.5 Corollary 


Let M be a faithful, simple R-module, and let D = Endr(M), a division ring by (9.2.1(b)). 
If M is a finite-dimensional vector space over D, then Endp(M) & R, a ring isomorphism. 


Proof. Let {#1,...,%,} be a basis for M over D. By (9.2.4), if f € Endp(M), there 
exists r € R such that f(x;) = ra; for alli = 1,...,n. Since the x; form a basis, 
we have f(a) = ra for every x € M. Thus the map h from R to Endp(M) given by 
r — gr = multiplication by r is surjective. If rz = 0 for all « € M, then since M is 
faithful, we have r = 0 and h is injective. Since h(rs) = g, 0 gs = h(r)h(s), h is a ring 
isomorphism. d& 


Problems For Section 9.2 


1. Criticize the following argument. Let M be a simple R-module, and let A = Endr(M). 
“Obviously” M is also a simple A-module. For any additive subgroup N of M that 
is closed under the application of all R-endomorphisms of M is, in particular, closed 
under multiplication by an element r € R. Thus N is an R-submodule of M, hence 
is 0 or M. 


2. Let M be a nonzero cyclic module. Show that M is simple if and only if ann M, the 
annihilator of M/, is a maximal left ideal. 


3. In Problem 2, show that the hypothesis that M is cyclic is essential. 


4. Let V = F” be the n-dimensional vector space of all n-tuples with components in the 
field F. If T is a linear transformation on V, then V becomes an F'[X]-module via 
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f(X)v = f(T)v. For example, if n = 2, T(a,b) = (0, a), and f(X) = a9 +a,X +--+ 
anX”, then 


f(X)(1,0) = ap(1, 0) + aT (1,0) + agT?(1,0) + --- + anT"(1,0) 
= (ao, 0) = (0, a1) 


= (ao, @1). 


Show that in this case, V is cyclic but not simple. 


5. Suppose that M is a finite-dimensional vector space over an algebraically closed field F, 
and in addition M is a module over a ring R containing F’ as a subring. If M is a 
simple R-module and f is an R-module homomorphism, in particular an F-linear 
transformation, on M, show that f is multiplication by some fixed scalar 4 € F’. This 
result is frequently given as a third part of Schur’s lemma. 


6. Let I be a left ideal of the ring R, so that R/I is an R-module but not necessarily a 
ring. Criticize the following statement: “Obviously”, J annihilates R/T. 


9.3. Simple and Semisimple Rings 


9.3.1 Definitions and Comments 


Since a ring is a module over itself, it is natural to call a ring R semisimple if it is 
semisimple as an R-module. Our aim is to determine, if possible, how semisimple rings 
are assembled from simpler components. A plausible idea is that the components are 
rings that are simple as modules over themselves. But this turns out to be too restrictive, 
since the components would have to be division rings (Section 9.1, Problem 1). 

When we refer to a simple left ideal I of R, we will always mean that J is simple as a 
left R-module. We say that the ring R is simple if R is semisimple and all simple left ideals 
of R are isomorphic. [The definition of simple ring varies in the literature. An advantage 
of our choice (also favored by Lang and Bourbaki) is that we avoid an awkward situation 
in which a ring is simple but not semisimple.] Our goal is to show that the building blocks 
for semisimple rings are rings of matrices over a field, or more generally, over a division 
ring. 

The next two results give some properties of modules over semisimple rings. 


9.3.2 Proposition 


If R is a semisimple ring, then every nonzero R-module M is semisimple. 


Proof. By (4.3.6), M is a quotient of a free R-module F’. Since F is a direct sum of copies 
of R (see (4.3.4)), and R is semisimple by hypothesis, it follows from (9.1.2) that F is 
semisimple. By (9.1.3), M is semisimple. d& 
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9.3.3 Proposition 


Let I be a simple left ideal in the semisimple ring R, and let M be a simple R-module. 
Denote by IM the R-submodule of M consisting of all finite linear combinations )7; rix;, 
r, € I, x; € M. Then either IM = M and I is isomorphic to M, or IM = 0. 


Proof. If IM # 0, then since M is simple, JM = M. Thus for some x € M we have 
Ix #0, and again by simplicity of MW, we have Iz = M. Map I onto M by r > ra, and 
note that the kernel cannot be I because Ix #4 0. Since I is simple, the kernel must be 0, 
sol=M. & 


9.3.4 Beginning the Decomposition 


Let R be a semisimple ring. We regard two simple left ideals of R as equivalent if they are 
isomorphic (as R-modules), and we choose a representative I;, i € T from each equivalence 
class. We define the basic building blocks of R as 


B; = the sum of all left ideals of R that are isomorphic to Jj. 


We have a long list of properties of the B; to establish, and for the sake of economy we 
will just number the statements and omit the words “Lemma” and “Proof” in each case. 
We will also omit the end of proof symbol, except at the very end. 


9.3.5 

Ifi Aj, then B;B; =0. [The product of two left ideals is defined exactly as in (9.3.3).] 
Apply (9.3.3) with I replaced by B; and M by B;. 

9.3.6 


R= ier Bi 
If r € R, then (r) is a left ideal, which by (9.1.2) and (9.1.3) (or (9.3.2)) is a sum of 
simple left ideats. 


9.3.7 


Each B; is a two-sided ideal. 
Using (9.3.5) and (9.3.6) we have 


B,C B,R= B;)_ B; = BB; C RB; C B;. 
J 


9.3.8 


R has only finitely many isomorphism classes of simple left ideals [,,..., ;. 

By (9.3.6), we can write the identity 1 of R as a finite sum of elements e; € B;, i € T. 
Adjusting the notation if necessary, let 1 = ee e;. If r € B; where 7 ¢ {1,...,t}, then 
by (9.3.5), re; = 0 for alli=1,...,t, so r=rl=0. Thus B; =0 for 7 ¢ {1,..., t}. 
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9.3.9 


R= 6'_,B;. Thus 1 has a unique representation as ee e;, with e; € B;. 
By (9.3.6) and (9.3.8), R is the sum of the B;. If b; +--- +b; = 0, with b; € B;, then 


0 = e;(b) +--+ + b¢) = exby +--+ egb; = C:d; = (er +--+ + e4)d; = 10; = 0. 


Therefore the sum is direct. 


9.3.10 


If b; € Bi, then e;b; = b; = bje;. Thus e; is the identity on B; and B; = Re; = e;R. 

The first assertion follows from the computation in (9.3.9), along with a similar com- 
putation with e; multiplying on the right instead of the left. Now B; C Re; because 
b; = bje;, and Re; C B; by (9.3.7) and the fact that e; € B;. The proof that B; = e;R is 
similar. 


9.3.11 


Each B; is a simple ring. 

By the computation in (9.3.7), along with (9.3.10), B; is a ring (with identity e;). 
Let J be a simple left ideal of B;. By (9.3.5) and (9.3.6), RJ = B,J = J, so J is a left 
ideal of R, necessarily simple. Thus J is isomorphic to some J;, and we must have j = 7. 
[Otherwise, J would appear in the sums defining both B; and B;, contradicting (9.3.9).] 
Therefore B; has only one isomorphism class of simple left ideals. Now B; is a sum of 
simple left ideals of R, and a subset of B; that is a left ideal of R must be a left ideal 
of B;. Consequently, B; is semisimple and the result follows. 


9.3.12 


If M is a simple R-module, then M is isomorphic to some J;. Thus there are only finitely 
many isomorphism classes of simple R-modules. In particular, if R is a simple ring, then 
all simple R-modules are isomorphic. 

By (9.3.9), 


t t 


ho Be eT =} 


i=l 


where the J are simple left ideals. Therefore 
t t 
M=RM=S°BM=)S_S{JM: J 2 Ij}. 
i=l i=l 


By (9.3.3), JM =0 or J = M. The former cannot hold for all J, since M 4 0. Thus 
M = TI, for some 7. If R is a simple ring, then there is only one 7, and the result follows. 
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9.3.13 


Let M be a nonzero R-module, so that M is semisimple by (9.3.2). Define M; as the sum of 
all simple submodules of M that are isomorphic to [;, so that by (9.3.12), M = ae M;. 
Then 


t 
M = @B,M and B;M e;M = M,;, i=1,...,t. 


4=1 
By definition of B;, 
BM; = “{JMj: J = Li} 


where the J’s are simple left ideals. If N is any simple module involved in the definition 
of M,;, then JN is 0 or N, and by (9.3.3), JN = N implies that N = J = J;. But all 
such N are isomorphic to J;, and therefore B;M; = 0,i 4 j. Thus 


M; = RM; = S_ ByM; = BiM; 
9. 


and 

BM =)_ BiM; = BiM,. 

Hi) 

Consequently, M; = B;M = e;RM = e;M (using (9.3.10)), and all that remains is to 
show that the sum of the M; is direct. Let 7; +---+ a, =0, x; € M;. Then 

O = e(a. +--+ + 24) = E12; 
since e;z; € B;M; = 0 fori # j. Finally, by (9.3.9), 

ea, = (C1 +--+ + ex) ay = 2}. 


9.3.14 


A semisimple ring R is ring-isomorphic to a direct product of simple rings. 
This follows from (9.3.9) and (9.3.5). For if a;,6; € Bi, then 


(ay +--+ +ay)(br +--+ +2) =aibi +--+ arb. 


Problems For Section 9.3 


In Problems 1 and 2, let MZ be a semisimple module, so that M is the direct sum of simple 
modules M;, 7 € I. We are going to show that M is a finite direct sum of simple modules 
if and only if M is finitely generated. 


1. Suppose that x1,...,2%, generate M. It will follow that M is the direct sum of finitely 
many of the M;. How would you determine which M,’s are involved? 
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2. Conversely, assume that M is a finite sum of simple modules. Show that M is finitely 
generated. 

3. A left ideal J is said to be minimal if I #4 0 and I has no proper subideal except 0. 
Show that the ring R is semisimple if and only if R is a direct sum of minimal left 
ideals. 


4. Is Z semisimple? 
5. Is Zp, semisimple? 


6. Suppose that FR is a ring with the property that every nonzero R-module is semi- 
simple. Show that every R-module M is projective, that is, every exact sequence 
0—- A— B— M — 0 splits. Moreover, M is injective, that is, every exact sequence 
0 M A B 0 splits. [Projective and injective modules will be studied in 
Chapter 10.] 


7. For any ring R, show that the following conditions are equivalent. 


(a) 
(b) 
(c) 

) 


(d) Every R-module is injective. 


R is semisimple; 
Every nonzero R-module is semisimple; 


Every R-module is projective; 


9.4 Further Properties of Simple Rings, Matrix Rings, 
and Endomorphisms 


To reach the Wedderburn structure theorem, we must look at simple rings in more detail, 
and supplement what we already know about matrix rings and rings of endomorphisms. 


9.4.1 Lemma 


Let R be any ring, regarded as a left module over itself. If h: R — M is an R-module 
homomorpbism, then for some z € M we have h(r) = rx for every r € R. Moreover, we 
may choose 7 = A(1), and the map h — hA(1) is an isomorphism of Homr(R, M) and M. 
This applies in particular when M = R, in which case h € Endpr(R). 


Proof. The point is that h is determined by what it does to the identity. Thus 

h(r) = h(rl) = rh(1) 
so we may take x = A(1). If s € R and h € Homp,(R, M), we take (sh)(r) = h(rs) = rsx. 
This makes Hompy(R, M) into a left R-module isomorphic to M. (For further discussion 


of this idea, see the exercises in Section 10.7.) @& 


Notice that although all modules are left R-modules, h is given by multiplication on 
the right by x. 
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9.4.2 Corollary 


Let IJ and J be simple left ideals of the simple ring R. Then for some z € R we have 
J=Iz. 


Proof. By the definition of a simple ring (see (9.3.1)), R is semisimple, so by (9.1.2), 
R=I@®L for some left ideal L. Again by the definition of a simple ring, J and J are 
isomorphic (as R-modules). If 7: J — J is an isomorphism and 7 is the natural projection 
of R on I, then tm € Endpr(R), so by (9.4.1), there exists x € R such that rx(r) = ra for 
every r € R. Allow r to range over J to conclude that J=Ix. dh 


A semisimple ring can be expressed as a direct sum of simple left ideals, by (9.1.2). If 
the ring is simple, only finitely many simple left ideals are needed. 


9.4.3 Lemma 
A simple ring R is a finite direct sum of simple left ideals. 


Proof. Let R= @;1; where the J; are simple left ideals. Changing notation if necessary, 
we have 1 = y, +--+ + Ym with y; € lj, j =1,...,m. If x € R, then 


m m 
r=al= So xy; € yp 
j=l j=l 


Therefore RF is a finite sum of the J;, and the sum is direct because the original decom- 
position of R is direct. & 


9.4.4 Corollary 
If I is a simple left ideal of the simple ring R, then TR = R. 


Proof. If J is any simple left ideal of R, then by (9.4.2), J C IR. By (9.4.3), R is a finite 
(direct) sum of simple left ideals, so R C IR. The reverse inclusion always holds, and the 
result follows. @& 


We now have some insight into the structure of simple rings. 


9.4.5 Proposition 
If R is a simple ring, then the only two-sided ideals of R are 0 and R. 


Proof. Let J be a nonzero 2-sided ideal of R. By (9.1.3), J is a semisimple left R-module, 
so by (9.1.2), J is a sum of simple left ideals of J, hence of R. In particular, J contains a 
simple left ideal J. Since J is a right ideal, it follows that J = JR. Using (9.4.4), we have 


J=JROIR=R 


soJ=R. & 
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In the literature, a simple ring is often defined as a ring R whose only two-sided ideals 
are 0 and R, but then extra hypotheses must be added to force R to be semisimple. See 
the exercises for further discussion. 


9.4.6 Corollary 


Let J be a simple left ideal of the simple ring R, and let M be a simple R-module. Then 
IM = M and M is faithful. 


Proof. The first assertion follows from a computation that uses associativity of scalar 
multiplication in a module, along with (9.4.4): 


M = RM =(IR)M =1(RM) =IM. (1) 


Now let b belong to the annihilator of M, so that bM = 0. We must show that b =0. By 
a computation similar to (1) (using in addition the associativity of ring multiplication), 


RbRM = RbM = RO =0. (2) 


But ROR is a two-sided ideal of R (see (2.2.7)), so by (9.4.5), RDR = 0 or R. In the latter 
case, M = RM = RbRM = 0 by (2), contradicting the assumption that M is simple. 
Therefore ROR = 0, in particular, b=1b1=0. & 


We are now ready to show that a simple ring is isomorphic to a ring of matrices. Let 
R be a simple ring, and V a simple R-module. [V exists because R is a sum of simple 
left ideals, and V is unique up to isomorphism by (9.3.12).] Let D = Endr(V), a division 
ring by Schur’s lemma (9.2.1(b)). Then (see (9.2.2)), V is a D-module, in other words, 
a vector space over D. V is a faithful R-module by (9.4.6), and if we can prove that V 
is finite-dimensional as a vector space over D, then by (9.2.5), R is ring-isomorphic to 
Endp(V). If n is the dimension of V over D, then by (4.4.1), Endp(V) & M,,(D°), the 
ring of n by n matrices with entries in the opposite ring D°. 


9.4.7 Theorem 


Let R be a simple ring, V a simple R-module, and D the endomorphism ring Endr(V). 
Then V is a finite-dimensional vector space over D. If the dimension of this vector space 
is n, then (by the above discussion), 


R®& Endp(V) & M,(D°). 


Proof. Assume that we have infinitely many linearly independent elements 21, 2%2,.... 
Let I,, be the left ideal {r € R: ra; =0 for alli = 1,...,m}. Then the I, decrease as m 
increases, in fact they decrease strictly. [Given any m, let f be a D-linear transformation 
on V such that f(x;) =0 for 1 <i< mand f(am+41) # 0. By Jacobson’s theorem (9.2.4), 
there exists r € R such that f(x;) =raj,i=1,...,m+1. But then rz] =---=ra, =0, 
rim41 # 0, so r € Im \ Im4y1.] Write Im = Jm ® Im4i, as in (9.1.2) part (c). [Recall 
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from (9.1.3) that since R is semisimple, so are all left ideals.] Iterating this process, we 
construct a left ideal Jj © Jo ®---, and again by (9.1.2(c)), 


R=)O418d.08°-::. 


Therefore 1 is a finite sum of elements y; € Jj, 1 =0,1,...,¢. But then 


R=HOEAG::- OF 


and it follows that J;,; must be 0, a contradiction. & 


Problems For Section 9.4 


Problems 1-5 are the key steps in showing that a ring R is simple if and only if R is 
Artinian and has no two-sided ideals except 0 and R. Thus if a simple ring is defined as 
one with no nontrivial two-sided ideals, then the addition of the Artinian condition gives 
our definition of simple ring; in particular, it forces the ring to be semisimple. The result 
that an Artinian ring with no nontrivial two-sided ideals is isomorphic to a matrix ring 
over a division ring (Theorem 9.4.7) is sometimes called the Wedderburn-Artin theorem. 


In Problems 1—5, “simple” will always mean simple in our sense. 


. By (9.4.5), a simple ring has no nontrivial two-sided ideals. Show that a simple ring 


must be Artinian. 


. If Ris an Artinian ring, show that there exists a simple R-module. 


. Let R be an Artinian ring with no nontrivial two-sided ideals. Show that R has a 


faithful, simple R-module. 


. Continuing Problem 3, if V is a faithful, simple R-module, and D = Endr(V), show 


that V is a finite-dimensional vector space over D. 


. Continuing Problem 4, show that R is ring-isomorphic to Endp(V), and therefore to 


a matrix ring M,,(D°) over a division ring. 


In the next section, we will prove that a matrix ring over a division ring is simple; this 


concludes the proof that R is simple iff R is Artinian with no nontrivial two-sided ideals. 
(In the “if” part, semisimplicity of R follows from basic properties of matrix rings; see 
Section 2.2, Problems 2, 3 and 4.) 


6. 


If an R-module M is a direct sum 67_, M; of finitely many simple modules, show that 
M has a composition series. (Equivalently, by (7.5.12), M is Artinian and Noetherian.) 


. Conversely, if M/ is semisimple and has a composition series, show that M is a finite 


direct sum of simple modules. (Equivalently, by Section 9.3, Problems 1 and 2, M is 
finitely generated.) 


9.5 The Structure of Semisimple Rings 


We have now done all the work needed for the fundamental theorem. 
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9.5.1 Wedderburn Structure Theorem 


Let R be a semisimple ring. 


(1) R is ring-isomorphic to a direct product of simple rings B,,..., By. 


(2) There are t isomorphism classes of simple R-modules. If V,,...,V; are representatives 
of these classes, let D; be the division ring Endr(V;). Then V; is a finite-dimensional 
vector space over D;. If n; is the dimension of this vector space, then there is a ring 
isomorphism 


B= End p, (Vi) = Mn, (D?). 


Consequently, R is isomorphic to the direct product of matrix rings over division 
rings. Moreover, 


(3) BiV; =0,i #5; BiV; = Vi. 


Proof. Assertion (1) follows from (9.3.5), (9.3.9) and (9.3.14). By (9.3.8) and (9.3.12), 
there are ¢ isomorphism classes of simple R-modules. The remaining statements of (2) 
follow from (9.4.7). The assertions of (3) follow from (9.3.13) and its proof. & 


Thus a semisimple ring can always be assembled from matrix rings over division rings. 
We now show that such matrix rings can never combine to produce a ring that is not 
semisimple. 


9.5.2 Theorem 


The ring M,,(R) of all n by n matrices with entries in the division ring R is simple. 


Proof. We have done most of the work in the exercises for Section 2.2. Let Cy be the 
set of matrices whose entries are 0 except perhaps in column k, k = 1...,n. Then Cy 
is a left ideal of M,,(R), and if any nonzero matrix in Cy belongs to a left ideal J, then 
C, CI. (Section 2.2, Problems 2, 3, 4.) Thus each C; is a simple left ideal, and M,,(R), 
the direct sum of C),...,Cy, is semisimple. 

Now let J be a nonzero simple left ideal. A nonzero matrix in J must have a nonzero 
entry in some column, say column k. Define f: I > Cy by f(A) = Ag, the matrix 
obtained from A by replacing every entry except those in column k by 0. Then f is an 
M,,(R)-module homomorphism, since 


f(BA) = (BA), = BA, = BF (A). 
By construction, f is not identically 0, so by Schur’s lemma, f is an isomorphism. Since 


the Cy are mutually isomorphic, all simple left ideals are isomorphic, proving that M,,(R) 
is simple. @& 
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9.5.3 Informal Introduction to Group Representations 


A major application of semisimple rings and modules occurs in group representation 
theory, and we will try to indicate the connection. Let k be any field, and let G be a finite 
group. We form the group algebra kG, which is a vector space over k with basis vectors 
corresponding to the elements of G. In general, if G = {x1,...,%m}, the elements of kG 
are of the form a ,x%1 + ---+ QmXZm, where the a; belong to k. Multiplication in kG is 
defined in the natural way; we set 


(ax;)(Gxj) = Bayz; 


and extend by linearity. Then kG is a ring (with identity 1,1¢) that is also a vector space 
over k, and a(ay) = (ax)y = r(ay), a € k, x,y € G, so kG is indeed an algebra over 
k. [This construction can be carried out with an arbitrary ring R in place of k, and with 
an arbitrary (not necessarily finite) group G. The result is the group ring RG, a free 
R-module with basis G.] 

Now let V be an n-dimensional vector space over k. We want to describe the situation 
in which “G acts linearly on V”. We are familiar with group action (Section 5.1), but we 
now add the condition that each g € G determines a linear transformation p(g) on V. We 
will write p(g)(v) as simply gv or g(v), so that g(av + Bw) = ag(v) + Bg(w). Thus we 
can multiply vectors in V by scalars in G. Since elements of kG are linear combinations 
of elements of G with coefficients in k, we can multiply vectors in V by scalars in kG. To 
summarize very compactly, 


V is a kG-module. 


Now since G acts on V, (hg)v = h(gv) and lev = v, gh € G,v € V. Thus p(hg) = 
p(h)p(g), and each p(g) is invertible since p(g)p(g~') = p(1c) = the identity on V. 
Therefore 


p is a homomorphism from G to GL(V), 


the group of invertible linear transformations on V. Multiplication in GL(V) corresponds 
to composition of functions. 


The homomorphism p is called a representation of G in V, 


and n, the dimension of V, is called the degree of the representation. If we like, we can 
replace GL(V) by the group of all nonsingular n by n matrices with entries in k. In this 
case, p is called a matrix representation. 

The above process can be reversed. Given a representation p, we can define a linear 
action of G on V by gu = p(g)(v), and thereby make V a kG-module. Thus representations 
can be identified with kG-modules. 


9.5.4 The Regular Representation 


If G has order n, then kG is an n-dimensional vector space over k with basis G. We take V 
to be kG itself, with gu the product of g and v in kG. As an example, let G = {e,a,a7}, 
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a cyclic group of order 3. V is a 3-dimensional vector space with basis e,a,a?, and the 
action of G on V is determined by 
ee =e, ea=a, ea” = a’; 


ae =a, aa =a", aa’? =e; 


a’e =a’, a2a=e, a?a? =a. 


Thus the matrices p(g) associated with the elements g € G are 
1 0 0 0 0 1 0 1 0 
fe] = |0 1 O}, fa] =]1 O Of], [e7]=]0 0 1 
0 0 1 0 1 0 1 0 0 


9.5.5 The Role of Semisimplicity 


Suppose that p is a representation of G in V. Assume that the basis vectors of V can 
be decomposed into two subsets v(A) and v(B) such that matrix of every g € G has the 
form 

fais A 0 

lo Bi- 


(The elements of A and B will depend on the particular g, but the dimensions of A and B 
do not change.) The corresponding statement about V is that 


V=Va0VB 


where Vy and Vg are kG-submodules of V. We can study the representation by analyzing 
its behavior on the simpler spaces V4 and Vg. Maschke’s theorem, to be proved in the 
next section, says that under wide conditions on the field k, this decomposition process 
can be continued until we reach subspaces that have no nontrivial kG-submodules. In 
other words, every kG-module is semisimple. In particular, kG is a semisimple ring, 
and the Wedderburn structure theorem can be applied to get basic information about 
representations. 

We will need some properties of projection operators, and it is convenient to take care 
of this now. 


9.5.6 Definitions and Comments 


A linear transformation 7 on a vector space V [or more generally, a module homomor- 

phism] is called a projection of V (on 7(V)) if m is idempotent, that is, 7? = 7. We 

have already met the natural projection of a direct sum onto a component, but there 

are other possibilities. For example, let p be the projection of R? = R@R given by 

p(z,y) = (4, —$**). Note that 7 must be the identity on 7(V), since t(m(v)) = 7(v). 
If we choose subspaces carefully, we can regard any projection as natural. 
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9.5.7 Proposition 


If 7 is a projection on V, then V is the direct sum of the image of 7 and the kernel of 7. 


Proof. Since v = m(v) + (v — m(v)) and a(v — z(v)) = 0, V = imV + kerV. To show 
that the sum is direct, let v = m(w) € kerma. Then 0 = r(v) = 7?(w) = a(w) = v, so 
imaOkerr=0. & 


9.5.8 Example 


For real numbers x and y, we have (#,y) = (x — cy)(1,0) + y(c,1), where c is any fixed 
real number. Thus R? = R(1,0) @ R(c, 1), and if we take p(x, y) = (a —cy,0), then p isa 
projection of R? onto R(1,0). By varying ¢ we can change the complementary subspace 
R(c, 1). Thus we have many distinct projections onto the same subspace R(1, 0). 


Problems For Section 9.5 


1. Show that the regular representation is faithful, that is, the homomorphism p is injec- 
tive. 


2. Let G be a subgroup of S,, and let V be an n-dimensional vector space over k with 
basis v(1),...,v(n). Define the action of G on V by 


g(v(t)) = v(g(t)), t= 1,...,n. 


Show that the action is legal. (V is called a permutation module.) 

3. Continuing Problem 2, if n = 4, find the matrix of g = (1, 4,3). 

4. Here is an example of how a representation can arise in practice. Place an equilateral 
triangle in the plane V, with the vertices at v1 = (1,0), v2 = (—4,4 3) and v3 = 
(—4,-$¥V3); note that v; + v2 + v3 = 0. Let G = Dg be the group of symmetries 
of the triangle, with g = counterclockwise rotation by 120 degrees and h = reflection 
about the horizontal axis. Each member of Dg is of the form g'hJ, i = 0,1,2, 7 = 0,1, 
and induces a linear transformation on V. Thus we have a representation of G in V 
(the underlying field k can be taken as R). 


With v; and v2 taken as a basis for V, find the matrices [g] and [h] associated with g 
and h. 


5. Continue from Problem 4, and switch to the standard basis e; = v1 = (1,0), eg = 
(0,1). Changing the basis produces an equivalent matrix representation. The matrix 
representing the element a € G is now of the form 


[a] = P~'[a]P 


where the similarity matrix P is the same for every a € G (the key point). 


Find the matrix P corresponding to the switch from {v1,v2} to {e1,e2}, and the 
matrices [g]’ and [A]. 
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6. Consider the dihedral group Dg, generated by elements R (rotation) and F (reflection). 
We assign to R the 2 by 2 matrix 


and to F' the 2 by 2 matrix 


1 0 
o-[f 
Show that the above assignment determines a matrix representation of Dg of degree 2. 


7. Is the representation of Problem 6 faithful? 


A very accessible basic text on group representation theory is “Representations and 
Characters of Groups” by James and Liebeck. 


9.6 Maschke’s Theorem 


We can now prove the fundamental theorem on decomposition of representations. It is 
useful to isolate the key ideas in preliminary lemmas. 


9.6.1 Lemma 


Let G be a finite group, and k a field whose characteristic does not divide |G| (so that 
division by |G| is legal). Let V be a kG-module, and w a linear transformation on V as 
a vector space over k. Define 0: V — V by 


Ov) = ay val. 


gEG 


Then not only is 6 a linear transformation on the vector space V, but it is also a kG- 
homomorphism. 


Proof. Since ~ is a linear transformation and G acts linearly on V (see (9.5.3)), 6 is linear. 
Now if h € G, then 


As g ranges over all of G, so does gh. Thus we can let « = gh, g~' = ha—!, to obtain 


O(hv) = a i) = hO(v) 


and the result follows. & 
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9.6.2 Lemma 


In (9.6.1), suppose that ~ is a projection of V on a subspace W that is also a kG-submodule 
of V. Then @ is also a projection of V on W. 


Proof. If v € W, then g(v) € W since W is a kG-submodule of V. Thus wg(v) = g(v) 
since 7 is a projection on W. By definition of @ we have 6(v) = v. To prove that 6? = 6, 
note that since ~ maps V into the kG-submodule W,, it follows from the definition of 6 
that 6 also maps V into W. But @ is the identity on W, so 


and @ is a projection. Since 6 maps into W and is the identity on W, @ is a projection of 
VonW. & 


9.6.3. Maschke’s Theorem 


Let G be a finite group, and k a field whose characteristic does not divide |G|. If V isa 
kG-module, then V is semisimple. 


Proof. Let W be a kG-submodule of V. Ignoring the group algebra for a moment, we 
can write V = W @U as vector spaces over k. Let w be the natural projection of V 
on W, and define @ as in (9.6.1). By (9.6.1) and (9.6.2), 6 is a kG-homomorphism and 
also a projection of V on W. By (9.5.7), V = im@ @ ker@ = W @ ker@ as kG-modules. 
By (9.1.2), V is semisimple. @& 


We have been examining the decomposition of a semisimple module into a direct sum 
of simple modules. Suppose we start with an arbitrary module M, and ask whether M 
can be expressed as M, ®@ M2, where M, and M2 are nonzero submodules. If so, we can 
try to decompose M; and Ms, and so on. This process will often terminate in a finite 
number of steps. 


9.6.4 Definition 


The module M is decomposable if M = M, @ Moz, where My, and Mo are nonzero sub- 
modules. Otherwise, M is indecomposable. 


9.6.5 Proposition 


Let M be a module with a composition series; equivalently, by (7.5.12), M is Noetherian 
and Artinian. Then M can be expressed as a finite direct sum 67_, M; of indecomposable 
submodules. 


Proof. If the decomposition process does not terminate, infinite ascending and descending 
chains are produced, contradicting the hypothesis. d& 
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As the above argument shows, the hypothesis can be weakened to M Noetherian or 
Artinian. But (9.6.5) is usually stated along with a uniqueness assertion which uses the 
stronger hypothesis: 

If M has a composition series and M = Oj_,Mi = O7,N;, where the M; and N; 
are indecomposable submodules, then n = m and the M; are, up to isomorphism, just a 
rearrangement of the Nj. 

The full result (existence plus uniqueness) is most often known as the Krull-Schmidt 
Theorem. [One or more of the names Remak, Azumaya and Wedderburn are sometimes 
added.] The uniqueness proof is quite long (see, for example, Jacobson’s Basic Algebra II), 
and we will not need the result. 

Returning to semisimple rings, there is an asymmetry in the definition in that a ring is 
regarded as a left module over itself, so that submodules are left ideals. We can repeat the 
entire discussion using right ideals, so that we should distinguish between left-semisimple 
and right-semisimple rings. However, this turns out to be unnecessary. 


9.6.6 Theorem 
A ring R is left-semisimple if and only if it is right-semisimple. 


Proof. If R is left-semisimple, then by (9.5.1), R is isomorphic to a direct product of 
matrix rings over division rings. But a matrix ring over a division ring is right-simple 
by (9.5.2) with left ideals replaced by right ideals. Therefore R is right-semisimple. The 
reverse implication is symmetrical. d& 


Problems For Section 9.6 


1. Let V be the permutation module for G = S3 (see Section 9.5, Problem 2), with basis 
V1, V2, U3. Give an example of a nontrivial kG-submodule of V. 


In Problems 2—4, we show that Maschke’s theorem can fail if the characteristic of k 
divides the order of G. Let G = {1,a,...,a?~'} be a cyclic group of prime order p, and 
let V be a two-dimensional vector space over the field F,, with basis vj,v2. Take the 
matrix of a as 


Pp? 


so that 


and [a] is the identity. 

2. Show that W, the one-dimensional subspace spanned by vj, is a kG-submodule of V. 
3. Continuing Problem 2, show that W is the only one-dimensional kG-submodule of V. 
4. Continuing Problem 3, show that V is not a semisimple kG-module. 
5 


. Show that a semisimple module is Noetherian iff it is Artinian. 
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6. Let M be adecomposable R-module, so that M is the direct sum of nonzero submodules 
M, and M2. Show that Endp(M) contains a nontrivial idempotent e (that is, e? = e 
with e not the zero map and not the identity). 


7. Continuing from Problem 6, suppose conversely that Endr(M) contains a nontrivial 
idempotent e. Show that M is decomposable. (Suggestion: use e to construct idem- 
potents e; and eg that are orthogonal, that is, e,e2 = e2e1 = 0.) 


9.7 The Jacobson Radical 


There is a very useful device that will allow us to look deeper into the structure of rings. 


9.7.1 Definitions and Comments 


The Jacobson radical J(R) of a ring R is the intersection of all maximal left ideals of R. 
More generally, the Jacobson radical J(M) = Jr(M) of an R-module M is the intersection 
of all maximal submodules of M. [“Maximal submodule” will always mean “maximal 
proper submodule”.] If M has no maximal submodule, take J(M) = M. 

If M is finitely generated, then every submodule N of M is contained in a maximal 
submodule, by Zorn’s lemma. [If the union of a chain of proper submodules is M, then 
the union contains all the generators, hence some member of the chain contains all the 
generators, a contradiction.] Taking N = 0, we see that J(/) is a proper submodule 
of M. Since R is finitely generated (by lr), J(R) is always a proper left ideal. 

Semisimplicity of M imposes a severe constraint on J(M). 


9.7.2 Proposition 


If M is semisimple, then J(M) = 0. Thus in a sense, the Jacobson radical is an “obstruc- 
tion” to semisimplicity. 


Proof. Let N be any simple submodule of M. By (9.1.2), M = N @ N’ for some sub- 
module N’. Now M/N’ & N, which is simple, so by the correspondence theorem, N’ is 
maximal. Thus J(M) C N’, and therefore J(M/)™ N = 0. Since M is a sum of simple 
modules (see (9.1.2)), J(M)=J(M)NM=0 & 


Here is another description of the Jacobson radical. 


9.7.3. Proposition 


J(R) is the intersection of all annihilators of simple R-modules. 


Proof. By Section 9.1, Problem 3, simple modules are isomorphic to R/J for maximal 
left ideals J. If r annihilates all simple R-modules, then for every maximal left ideal J, 
r annihilates R/J, in particular, r annihilates 1+ 7. Thus r(1 + J) = J, that is, r € I. 
Consequently, r € J(R). 

Conversely, assume r € J(R). If M is a simple R-module, choose any nonzero element 
x € M. The map f,: R — M given by f,(s) = sx is an epimorphism by simplicity of 
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M. The kernel of f, is the annihilator of x, denoted by ann(). By the first isomorphism 
theorem, M = R/ann(x). By simplicity of M, ann(x) is a maximal left ideal, so by 
hypothesis, r € Nee ann(x) = ann(M). Thus r annihilates all simple R-modules. & 


9.7.4 Corollary 
J(R) is a two-sided ideal. 


Proof. We noted in (4.2.6) that ann(M) is a two-sided ideal, and the result follows 
from (9.7.3). @& 


In view of (9.7.4), one might suspect that the Jacobson radical is unchanged if right 
rather than left ideals are used in the definition. This turns out to be the case. 


9.7.5 Definitions and Comments 


The element a € R is left quasi-regular (lqr) if 1 — a has a left inverse, right quasi-regular 
(rqr) if 1 — a has a right inverse, and quasi-regular (qr) if 1 — a is invertible. Note that 
if a is both lqr and rqr, it is qr, because if b(1 — a) = (1 — a)c = 1, then 


b=b1=b(1-a)c=I1c=c. 


9.7.6 Lemma 


Let I be a left ideal of R. If every element of J is lqr, then every element of I is qr. 


Proof. If a € I, then we have b(1 — a) = 1 for some b € R. Let c = 1 — b, so that 
(l—c)41-—a) =1-—a—c+ca=1. Thus c=ca—a=(c—1)a€é I. By hypothesis, c is 
lIqr, so 1 —c has a left inverse. But we know that (1 —c) has a right inverse (1 — a) [see 
above], so cis rqr. By (9.7.5), c is qr and 1 — c is the two-sided inverse of l—a. & 


9.7.7 Proposition 


The Jacobson radical J(R) is the largest two-sided ideal consisting entirely of quasi-regular 
elements. 


Proof. First, we show that each a € J(R) is lqr, so by (9.7.6), each a € J(R) is qr. If l—a 
has no left inverse, then R(1 — a) is a proper left ideal, which is contained in a maximal 
left ideal I (as in (2.4.2) or (9.7.1)). But then a € J and 1—a€ J, and therefore 1 € J,a 
contradiction. 

Now we show that every left ideal (hence every two-sided ideal) J consisting entirely 
of quasi-regular elements is contained in J(R). If a € I but a ¢ J(R), then for some 
maximal left ideal L we have a ¢ L. By maximality of L, we have 1+ L = R,sol=b+c 
for some b € IJ, c € L. But then 0 is quasi-regular, so c = 1 — 6 has an inverse, and 
consequently 1 € L, a contradiction. d& 
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9.7.8 Corollary 


J(R) is the intersection of all maximal right ideals of R. 


Proof. We can reproduce the entire discussion beginning with (9.7.1) with left and right 
ideals interchanged, and reach exactly the same conclusion, namely that the “right” 
Jacobson radical is the largest two-sided ideal consisting entirely of quasi-regular ele- 
ments. It follows that the “left”and “right” Jacobson radicals are identical. d& 


We can now use the Jacobson radical to sharpen our understanding of semisimple 
modules and rings. 


9.7.9 Theorem 


If M is a nonzero R-module, the following conditions are equivalent: 


(1) M is semisimple and has finite length, that is, has a composition series; 
(2) M is Artinian and J(M) =0. 


Proof. (1) implies (2) by (7.5.12) and (9.7.2), so assume M Artinian with J(M) = 0. 
The Artinian condition implies that the collection of all finite intersections of maximal 
submodules of M has a minimal element N. If S is any maximal submodule of M, then 
NOS is a finite intersection of maximal submodules, so by minimality of N, NOS =N, 
so N C S. Since J(M) is the intersection of all such $, the hypothesis that J(M) = 0 
implies that N = 0. Thus for some positive integer n we have maximal submodules 
M,,...,M, such that N7_,M; = 0. 

Now M is isomorphic to a submodule of M’ = @7_,(M/M;). To see this, map x € M 
to (a+M,,...,2+M,,) and use the first isomorphism theorem. Since M’ is a finite direct 
sum of simple modules, it is semisimple and has a composition series. (See Section 9.4, 
Problem 6.) By (9.1.3) and (7.5.7), the same is true for M. & 


9.7.10 Corollary 
The ring R is semisimple if and only if R is Artinian and J(R) = 0. 


Proof. By (9.7.9), it suffices to show that if R is semisimple, then it has a composition 
series. But this follows because R is finitely generated, hence is a finite direct sum of 
simple modules (see Section 9.3, Problem 1). & 


The Jacobson radical of an Artinian ring has some special properties. 


9.7.11 Definitions and Comments 


An ideal (or left ideal or right ideal) I of the ring R is nil if each element x € I is nilpotent, 
that is, «” = 0 for some positive integer m; I is nilpotent if I” = 0 for some positive 
integer n. Every nilpotent ideal is nil, and the converse holds if R is Artinian, as we will 
prove. 
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9.7.12 Lemma 
If I is a nil left ideal of R, then J C J(R). 


Proof. If « € I and x” = 0, then z is quasi-regular; the inverse of 1 — x is 1+a+a?+ 
--»+a'~!. The result follows from the proof of (9.7.7). & 


9.7.13 Proposition 


If Ris Artinian, then J(R) is nilpotent. Thus by (9.7.11) and (9.7.12), J(R) is the largest 
nilpotent ideal of R, and every nil ideal of R is nilpotent. 


Proof. Let J = J(R). The sequence J D J? D --- stabilizes, so for some n we have 
J” = Jrtl —..., in particular, J” = J?”. We claim that J” = 0. If not, then the 
collection of all left ideals Q of R such that J”Q 4 0 is nonempty (it contains J”), hence 
has a minimal element N. Choose « € N such that J°x 4 0. By minimality of N, 
J”°x = N. Thus there is an element c € J” such that cx = x, that is, (1—c)x = 0. But 
ce J” CJ, so by (9.7.7), 1—c is invertible, and consequently x = 0, a contradiction. & 


Problems For Section 9.7 
1. Show that an R-module is M cyclic if and only if M is isomorphic to R/I for some 
left ideal J, and in this case we can take J to be ann(M), the annihilator of MW. 


2. Show that the Jacobson radical of an R-module M is the intersection of all kernels of 
homomorphisms from M to simple R-modules. 


3. If [ = J(R), show that J(R/I) = 0. 
4. If f is an R-module homomorphism from M to N, show that f(J(M)) C J(N). 


5. Assume R commutative, so that J(R) is the intersection of all maximal ideals of R. If 
a € R, show that a € J(R) if and only if 1+ ab is a unit for every b € R. 


6. If N is a submodule of the Jacobson radical of the R-module M, show that J(M)/N = 
J(M/N). 


9.8 Theorems of Hopkins-Levitzki and Nakayama 


From Section 7.5, we know that a Noetherian ring need not be Artinian, and an Artinian 
module need not be Noetherian. But the latter situation can never arise for rings, because 
of the following result. 


9.8.1 Theorem (Hopkins and Levitzki) 


Let R be an Artinian ring, and M a finitely generated R-module. Then M is both Artinian 
and Noetherian. In particular, with M = R, an Artinian ring is Noetherian. 
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Proof. By (7.5.9), M is Artinian. Let J be the Jacobson radical of R. By Section 9.7, 
Problem 3, the Jacobson radical of R/J is zero, and since R/J is Artinian by (7.5.7), it 
is semisimple by (9.7.9). Now consider the sequence 


Mo=M, Mi = JM, Mz = J’M,.... 


By (9.7.13), J is nilpotent, so M,, = 0 for some n. Since JM; = Mj41, J annihilates 
M;/Mi+41, so by Section 4.2, Problem 6, M;/Mj+1 is an R/J-module. 


We claim that each M;/Mj,1 has a composition series. 


We can assume that M;/M;,1 4 0, otherwise there is nothing to prove. By (9.3.2), 
M;/Mj+1 is semisimple, and by (7.5.7), M;/Mi41 is Artinian. [Note that submodules of 
M;/Mj+1 are the same, whether we use scalars from R or from R/J; see Section 4.2, 
Problem 6.] By Section 9.6, Problem 5, M;/Mj41 is Noetherian, hence has a composition 
series by (7.5.12). Now intuitively, we can combine the composition series for the M;/Mj+1 
to produce a composition series for M, proving that M is Noetherian. Formally, M,_1 = 
M,,-1/Mn has a composition series. Since M,-2/M,~-1 has a composition series, so does 
My ~2, by (7.5.7). Iterate this process until we reach M. & 


We now proceed to a result that has many applications in both commutative and 
noncommutative algebra. 


9.8.2 Nakayama’s Lemma, Version 1 


Let M be a finitely generated R-module, and I a two-sided ideal of R. If J C J(R) and 
IM = M, then M =0. 


Proof. Assume M # 0, and let 271,...,2%n generate M, where n is as small as possible. 
(Then n > 1 and the 2; are nonzero.) Since x, € M = IM, we can write tp = 07", biy; 
for some 6; € J and y; © M. But y; can be expressed in terms of the generators as 
Y= ae aj;v; with a,; € R. Thus 


n 
Ln = ) diag Xs = ) CiX5 
ij j=l 


where c; = )7j", bjaij. Since I is a right ideal, c; € I C J(R). (We need I to be a left 
ideal to make IM a legal submodule of M.) The above equation can be written as 


and by (9.7.7), 1 — cy is invertible. If nm > 1, then x, is a linear combination of the other 
x;’s, contradicting the minimality of n. Thus n = 1, in which case (1 — c))a1 = 0, so 
x, =0, again a contradiction. d& 


There is another version of Nakayama’s lemma, which we prove after a preliminary 
result. 
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9.8.3 Lemma 


Let N be a submodule of the R-module M, I a left ideal of R. Then M = N+JM if and 
only if M/N = I(M/N). 


Proof. Assume M = N + IM, and let 1+ N € M/N. Then « = y+ z for some y € N 
and z€ IM. Write z= ees a;w;, a; € I, w; © M. It follows that 


x+N=ays(wi + N)+---+ae(wz+ N) € I(M/N). 


Conversely, assume M/N = I(M/N), and let x € M. Then 


t 
x+N =) -a(wit N) 


i=1 


with a; € J and w; € M. Consequently, x — eee ajw;€ N,sorEeN+IM. & 


9.8.4 Nakayama’s Lemma, Version 2 


Let N be a submodule of the R-module M, with M/N finitely generated over R. [This 
will be satisfied if MW is finitely generated over R.| If I is a two-sided ideal contained in 
J(R), and M=N+IM, then M=N. 


Proof. By (9.8.3), I(M/N) = M/N, so by (9.8.2), M/N =0, hence M=N. & 


Here is an application of Nakayama’s lemma. 


9.8.5 Proposition 


Let R be a commutative local ring with maximal ideal J (see (8.5.8)). Let M be a finitely 
generated R-module, and let V = M/JM. Then: 


(i) V is a finite-dimensional vector space over the residue field k = R/J. 


(ii) If {a1 + JM,...,¢%n + JM} is a basis for V over k, then {21,...,2%,} is a minimal 
set of generators for M. 


(iii) Any two minimal generating sets for M have the same cardinality. 


Proof. (i) Since J annihilates M/JM, it follows from Section 4.2, Problem 6, that V is 
a k-module, that is, a vector space over k. Since M is finitely generated over R, V is 
finite-dimensional over k. 

(ii) Let N = S07, Ra;. Since the 2; + JM generate V = M/JM, we have 
M=N+4JM. By (9.8.4), M = N, so the x; generate M. If a proper subset of the 2; 
were to generate M, then the corresponding subset of the x; + JM would generate V, 
contradicting the assumption that V is n-dimensional. 

(iii) A generating set S for M with more than n elements determines a spanning 
set for V, which must contain a basis with exactly n elements. By (ii), S cannot be 
minimal. & 
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Problems For Section 9.8 


1. Let a be a nonzero element of the integral domain R. If (a’) = (a***) for some positive 
integer t, show that a is invertible. 


2. Continuing Problem 1, show that every Artinian integral domain is a field. 
3. If R is a commutative Artinian ring, show that every prime ideal of R is maximal. 


4. Let R be a commutative Artinian ring. If S is the collection of all finite intersections 
of maximal ideals of R, then S is not empty, hence contains a minimal element 
L=]N12N---NIn, with the J; maximal. Show that if P is any maximal ideal of R, 
then P must be one of the J;. Thus R has only finitely many maximal ideals. 


5. An R-module is projective if it is a direct summand of a free module. We will study 
projective modules in detail in Section 10.5. We bring up the subject now in Prob- 
lems 5 and 6 to illustrate a nice application of Nakayama’s lemma. 

Let R be a commutative local ring, and let M be a finitely generated projective 
module over R, with a minimal set of generators {x1,...,2%n} (see (9.8.5)). We can 
assume that for some free module F of rank n, 


F=MBON. 
To justify this, let F' be free with basis e,,...,¢,, and map F' onto M via e; > a;, 
i=1,...,n. If the kernel of the mapping is K, then we have a short exact sequence 


(aK Sar ai 30, 


which splits since M is projective. [This detail will be covered in (10.5.3).] 
Let J be the maximal ideal of R, and k = R/J the residue field. Show that 


F/JF %M/JIM@N/JN. 


6. Continue from Problem 5 and show that N/JN = 0. It then follows from Nakayama’s 
lemma (9.8.2) that N = 0, and therefore M = F. We conclude that a finitely 
generated projective module over a commutative local ring is free. 


7. We showed in (9.6.6) that there is no distinction between a left and a right-semisimple 
ring. This is not the case for Noetherian (or Artinian) rings. 


Let X and Y be noncommuting indeterminates, in other words, XY #4 YX, and let 
Z< X,Y > be the set of all polynomials in X and Y with integer coefficients. [Elements 
of Z do commute with the indeterminates.] We impose the relations Y? = 0 and YX = 0 
to produce the ring R; formally, R= Z< X,Y > /(Y?,YX). 

Consider I = Z[X]Y, the set of all polynomials f(X)Y, f(X) € Z[X]. Then I is a 
two-sided ideal of R. Show that if J is viewed as a right ideal, it is not finitely generated. 
Thus R is not right-Noetherian. 


8. Viewed as a left R-module, R = Z[X] ® Z[X]Y. Show that R is left-Noetherian. 
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9. Assume the hypothesis of (9.8.5). If {@1,...,2,} is a minimal generating set for M, 
show that {%1,...,%,}, where 7; = x; + JM, is a basis for M/JM =V. 
10. Continuing Problem 9, suppose that {21,...,2,} and {y1,...,Yn} are minimal gen- 
erating sets for M, with y; = yy, aijgxj, aj € R. If A is the matrix of the a;;, show 
that the determinant of A is a unit in R. 


Chapter - 10 


> Introducing Homological 
Algebra 


Chapter 10 


Introducing Homological 
Algebra 


Roughly speaking, homological algebra consists of (A) that part of algebra that is funda- 
mental in building the foundations of algebraic topology, and (B) areas that arise naturally 
in studying (A). 


10.1 Categories 


We have now encountered many algebraic structures and maps between these structures. 
There are ideas that seem to occur regardless of the particular structure under consider- 
ation. Category theory focuses on principles that are common to all algebraic systems. 


10.1.1 Definitions and Comments 


A category C consists of objects A,B,C,... and morphisms f: A — B (where A and B 

are objects). If f: A + Band g: B — C are morphisms, we have a notion of composition, 

in other words, there is a morphism gf = go f: A > C, such that the following axioms 

are satisfied. 

(i) Associativity: If f: A— B, g: B— C,h: C > D, then (hg) f = higf); 

(ii) Identity: For each object A there is a morphism 14: A — A such that for each 
morphism f: A — B, we have fl4 = 1pf = f. 


A remark for those familiar with set theory: For each pair (A,B) of objects, the 
collection of morphisms f: A — B is required to be a set rather than a proper class. 

We have seen many examples: 
1. Sets: The objects are sets and the morphisms are functions. 
2. Groups: The objects are groups and the morphisms are group homomorphisms. 


3. Rings: The objects are rings and the morphisms are ring homomorphisms. 
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4. Fields: The objects are fields and the morphisms are field homomorphisms [= field 
monomorphisms; see (3.1.2)]. 

5. R-mod: The objects are left R-modules and the morphisms are R-module homomor- 
phisms. If we use right R-modules, the corresponding category is called mod-R. 


6. Top: The objects are topological spaces and the morphisms are continuous maps. 


7. Ab: The objects are abelian groups and the the morphisms are homomorphisms from 
one abelian group to another. 


A morphism f: A — B is said to be an isomorphism if there is an inverse morphism 
g: B — A, that is, gf = 14 and fg = 1g. In Sets, isomorphisms are bijections, and 
in Top, isomorphisms are homeomorphisms. For the other examples, an isomorphism is 
a bijective homomorphism, as usual. 

In the category of sets, a function f is injective iff f(a1) = f(a) implies 21 = x2. But 
in an abstract category, we don’t have any elements to work with; a morphism f: A — B 
can be regarded as simply an arrow from A to B. How do we generalize injectivity to an 
arbitrary category? We must give a definition that does not depend on elements of a set. 
Now in Sets, f is injective iff it has a left inverse; equivalently, f is left cancellable, i.e. 
if fhy = fho, then hy = ho. This is exactly what we need, and a similar idea works for 
surjectivity, since f is surjective iff f is right cancellable, i.e., hi f = haf implies hi = he. 


10.1.2 Definitions and Comments 


A morphism f is said to be monic if it is left cancellable, epic if it is right cancellable. 
In all the categories listed in (10.1.1), a morphism f is monic iff f is injective as a 
mapping of sets. If f is surjective, then it is epic, but the converse can fail. See Problems 2 
and 7-10 for some of the details. 
In the category R-mod, the zero module {0} has the property that for any R- 
module M, there is a unique module homomorphism from M to {0} and a unique module 
homomorphism from {0} to M. Here is a generalization of this idea. 


10.1.3. Definitions and Comments 


Let A be an object in a category. If for every object B, there is a unique morphism from 
A to B, then A is said to be an initial object. If for every object B there is a unique 
morphism from B to A, then A is said to be a terminal object. A zero object is both 
initial and terminal. 

In the category of sets, there is only one initial object, the empty set. The terminal 
objects are singletons {x}, and consequently there are no zero objects. In the category of 
groups, the trivial group consisting of the identity alone is a zero object. We are going 
to prove that any two initial objects are isomorphic, and similarly for terminal objects. 
This will be a good illustration of the duality principle, to be discussed next. 


10.1.4 Duality 


If C is a category, the opposite or dual category C°P has the same objects as C. The 
morphisms are those of C with arrows reversed; thus f: A — B is a morphism of C°P 
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iff f: B — A is a morphism of C. If the composition gf is permissible in C, then fg is 
permissible in C°P. To see how the duality principle works, let us first prove that if A 
and B are initial objects of C, then A and B are isomorphic. There is a unique morphism 
f: A— B and a unique morphism g: B — A. But 14: A— A and lg: B — B, and it 
follows that gf = 1,4 and fg =1g. The point is that we need not give a separate proof 
that any two terminal objects are isomorphic. We have just proved the following: 

If A and B are objects in a category C, and for every object D of C, there is a unique 
morphism from A to D and there is a unique morphism from B to D, then A and B are 
isomorphic. 

Our statement is completely general; it does not involve the properties of any specific 
category. If we go through the entire statement and reverse all the arrows, equivalently, 
if we replace C by C°P, we get: 

If A and B are objects in a category C, and for every object D of C, there is a unique 
morphism from D to A and there is a unique morphism from D to B, then A and B are 
isomorphic. 

In other words, any two terminal objects are isomorphic. If this is unconvincing, just 
go through the previous proof, reverse all the arrows, and interchange fg and gf. We say 
that initial and terminal objects are dual. Similarly, monic and epic morphisms are dual. 

If zero objects exist in a category, then we have zero morphisms as well. If Z is a 
zero object and A and B arbitrary objects, there is a unique f: A — Z and a unique 
g: Z > B. The zero morphism from A to B, denoted by O,4z, is defined as gf, and it 
is independent of the particular zero object chosen (Problem 3). Note that since a zero 
morphism goes through a zero object, it follows that for an arbitrary morphism h, we 
have h0O = 0h = 0. 


10.1.5 Kernels and Cokernels 


If f: A — B is an R-module homomorphism, then its kernel is, as we know, {x € 
A: f(a) = 0}. The cokernel of f is defined as the quotient group B/im(f). Thus f 
is injective iff its kernel is 0, and f is surjective iff its cokernel is 0. We will generalize 
these notions to an arbitrary category that contains zero objects. The following diagram 
indicates the setup for kernels. 


We take C' to be the kernel of the module homomorphism f, with 7 the inclusion map. If 
fg = 0, then the image of g is contained in the kernel of f, so that g actually maps into C. 
Thus there is a unique module homomorphism h: D — C' such that g = ih; simply take 
h(x) = g(a) for all x. The key to the generalization is to think of the kernel as the 
morphism i. This is reasonable because C' and 7 essentially encode the same information. 
Thus a kernel of the morphism f: A — B is a morphism 7: C > A such that: 


(1) fi=0. 
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(2) If g: D — A and fg = 0, then there is a unique morphism h: D — C such that 
g=th. 


Thus any map killed by f can be factored through 7. 
If we reverse all the arrows in the above diagram and change labels for convenience, 
we get an appropriate diagram for cokernels. 


peng eee gs 


NLA 


D 


We take p to be the canonical map of B onto the cokernel of f, so that C = B/im(f). If 
gf =0, then the image of f is contained in the kernel of g, so by the factor theorem, there 
is a unique homomorphism h such that g = hp. In general, a cokernel of a morphism 
f: A— Bisa morphism p: B > C such that: 


(1’) pf =0. 
(2’) If g: B — D and gf = 0, then there is a unique morphism h: C — D such that 
g = hop. 


Thus any map that kills f can be factored through p. 

Since going from kernels to cokernels simply involves reversing arrows, kernels and 
cokernels are dual. Note, however, that in an arbitrary category with 0, kernels and 
cokernels need not exist for arbitrary morphisms. But every monic has a kernel and (by 
duality) every epic has a cokernel; see Problem 5. 


Problems For Section 10.1 


1. Show that in any category, the identity and inverse are unique. 

2. In the category of rings, the inclusion map 7: Z — Q is not surjective. Show, however, 
that 7 is epic. 

3. Show that the zero morphism 0,48, is independent of the particular zero object chosen 
in the definition. 

4. Show that a kernel must be monic (and by duality, a cokernel must be epic). [In 
the definition of kernel, we can assume that 7 is monic in (1) of (10.1.5), and drop 
the uniqueness assumption on h. For 7 monic forces uniqueness of h, by definition of 
monic. Conversely, uniqueness of h forces i to be monic, by Problem 4.] 

5. Show that in a category with 0, every monic has a kernel and every epic has a cokernel. 


6. Show that if i: C — A and 7: D — A are kernels of f: A — B, then C and D are 
isomorphic. (By duality, a similar statement holds for cokernels.) 

7. Let f: A— B bea group homomorphism with kernel kK, and assume f not injective, 
so that kK 4 {1}. Let g be the inclusion map of K into A. Find a homomorphism h 
such that fg = fh but g FA. 


10.2. PRODUCTS AND COPRODUCTS 5 


8. It follows from Problem 7 that in the category of groups, f monic is equivalent to f 
injective as a mapping of sets, and a similar proof works in the category of modules. 
Why does the argument fail in the category of rings? 


9. Continue from Problem 8 and give a proof that does work in the category of rings. 


10. Let f: M — N be a module homomorphism with nonzero cokernel, so that f is not 
surjective. Show that f is not epic; it follows that epic is equivalent to surjective in 
the category of modules. 


10.2 Products and Coproducts 


We have studied the direct product of groups, rings, and modules. It is natural to try 
to generalize the idea to an arbitrary category, and a profitable approach is to forget 
(temporarily) the algebraic structure and just look at the cartesian product A = []; A; 
of a family of sets A;,i € I. The key property of a product is that if we are given maps 
f; from a set S into the factors A;, we can lift the f; into a single map f: S — |], Ai. 
The commutative diagram below will explain the terminology. 


Yt : 


oer a 


In the picture, p; is the projection of A onto the i” factor A;. If f;(x) = a;,i € I, we 
take f(x) = (a;,i € I). It follows that p;o f = f; for all i; this is what we mean by lifting 
the f; to f. (Notice that there is only one possible lifting, i.e., f is unique.) If A is the 
direct product of groups A; and the f; are group homomorphisms, then f will also be a 
group homomorphism. Similar statements can be made for rings and modules. We can 
now give a generalization to an arbitrary category. 


10.2.1 Definition 


A product of objects A; in a category C is an object A, along with morphisms p;: A — Aj, 
with the following universal mapping property. Given any object S of C and morphisms 
fi: S — Aj, there is a unique morphism f: S — A such that p;f = f; for all i. 

In a definition via a universal mapping property, we use a condition involving mor- 
phisms, along with a uniqueness statement, to specify an object and morphisms associated 
with that object. We have already seen this idea in connection with kernels and cokernels 
in the previous section, and in the construction of the tensor product in Section 8.7. 

Not every category has products (see Problems 1 and 2), but if they do exist, they are 
essentially unique. (The technique for proving uniqueness is also essentially unique.) 


10.2.2 Proposition 


If (A,p;,7 € I) and (B,q,i € I) are products of the objects A;, then A and B are 
isomorphic. 
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Proof. We use the above diagram (1) with S = Band f; = q; to get amorphism f: B > A 
such that p;f = q; for all 7. We use the diagram with S = A, A replaced by B, p; replaced 
by q, and f; = p;, to get a morphism h: A > B such that q;h = p;. Thus 


pifh=qh=p, and ghf =pif = 4. 
But 
Pila=p, and qglap=4q 


and it follows from the uniqueness condition in (10.2.1) that fh = 14 and hf = lg. 
Formally, we are using the diagram two more times, once with S = A and f; = p;, and 
once with S = B, A replaced by B, p; replaced by q;, and f; = q. Thus A and B are 
isomorphic. @& 


The discussion of diagram (1) indicates that in the categories of groups, abelian groups, 
rings, and R-modules, products coincide with direct products. But a category can have 
products that have no connection with a cartesian product of sets; see Problems 1 and 2. 
Also, in the category of torsion abelian groups (torsion means that every element has 
finite order), products exist but do not coincide with direct products; see Problem 5. 

The dual of a product is a coproduct, and to apply duality, all we need to do is reverse 
all the arrows in (1). The following diagram results. 


y | ° 


N Te M 
We have changed the notation because it is now profitable to think about modules. Sup- 
pose that M is the direct sum of the submodules M,, and 7; is the inclusion map of M; 
into M. Ifthe f; are module homomorphisms out of the factors M; and into a module 
N, the f; can be lifted to a single map f. If xj; € Mj1,...,2jr € Mjr, we take 


Fltji te + 25) = fpilegi) + +++ + fir(zir)- 


Lifting means that foi; = f; for all 7. We can now give the general definition of coproduct. 


10.2.3. Definition 


A coproduct of objects M; in a category C is an object M, along with morphisms 
i;: M; — M, with the following universal mapping property. Given any object N of C 
and morphisms f;: M; — N, there is a unique morphism f: M — N such that fi; = f; 
for all 7 

Exactly as in (10.2.2), any two coproducts of a given collection objects are isomorphic. 

The discussion of diagram (2) shows that in the category of R-modules, the coproduct 
is the direct sum, which is isomorphic to the direct product if there are only finitely many 
factors. In the category of sets, the coproduct is the disjoint union. To explain what 
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this means, suppose we have sets A;,j € J. We can disjointize the A; by replacing A; 
by Aj = {(2,j): «© € Aj}. The coproduct is A = Uj<), Aj, with morphisms i;: Aj > A 
given by i;(a;) = (a;,7). If for each 7 we have f;: A; — B, we define f: A — B by 
£ (45,5) = Fi(a). 


The coproduct in the category of groups will be considered in the exercises. 


Problems For Section 10.2 


1. 


Let S be a preordered set, that is, there is a reflexive and transitive relation < on S. 
Then S can be regarded as a category whose objects are the elements of S. If x < y, 
there is a unique morphism from zx to y, and if x < y, there are no morphisms from 
x to y. Reflexivity implies that there is an identity morphism on «, and transitivity 
implies that associativity holds. Show that a product of the objects x;, if it exists, 
must be a greatest lower bound of the x;. The greatest lower bound will be unique 
(not just essentially unique) if S is a partially ordered set, so that < is antisymmetric. 


2. Continuing Problem 1, do products always exist? 


3. Continuing Problem 2, what can be said about coproducts? 
4. If A is an abelian group, let T(A) be the set of torsion elements of A. Show that T(A) 


is a subgroup of A. 


Show that in the category of torsion abelian groups, the product of groups A; is 
T([] A:), the subgroup of torsion elements of the direct product. 


Assume that we have a collection of groups G;, pairwise disjoint except for a common 
identity 1. The free product of the G; (notation *;G;) consists of all words (finite 
sequences) @,---a@, where the a; belong to distinct groups. Multiplication is by con- 
catenation with cancellation. For example, with the subscript 7 indicating membership 
in G;, 


(a1a24304)(b4b2b¢b1 3) — a1 4203(a4b4)b2b6b1b3 


and if b4 = Git this becomes a,a2a3b2bgb,b3. The empty word is the identity, and 
inverses are calculated in the usual way, as with free groups (Section 5.8). In fact a 
free group on S is a free product of infinite cyclic groups, one for each element of S. 
Show that in the category of groups, the coproduct of the G; is the free product. 


Suppose that products exist in the category of finite cyclic groups, and suppose that 
the cyclic group C' with generator a is the product of the cyclic groups C; and C2 with 
generators a, and az respectively. Show that the projections p; and p2 associated with 
the product of C; and C3 are surjective. 

By Problem 7, we may assume without loss of generality that p;(a) = a;, i = 1,2. 
Show that for some positive integer n, na, = a; and nag = 0. [Take fi: Ci — Ci to 
be the identity map, and let fg: C1 — C2 be the zero map (using additive notation). 
Lift f; and fo to f: Cy > C)] 

Exhibit groups C; and C2 that can have no product in the category of finite cyclic 
groups. 
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10.3. Functors 


We will introduce this fundamental concept with a concrete example. Let Homr(M, N) 
be the set of R-module homomorphisms from M to N. As pointed out at the beginning 
of Section 4.4, Homr(M, N) is an abelian group. It will also be an R-module if R is a 
commutative ring, but not in general. We are going to look at Homr(M, N) as a function 
of N, with M fixed. 


10.3.1 The Functor Homp,(M, —) 


We are going to construct a mapping from the category of R-modules to the category of 
abelian groups. Since a category consists of both objects and morphisms, our map will 
have two parts: 


(i) Associate with each R-module N the abelian group Homr(M, N). 


(ii) Associate with each R-module homomorphism h: N — P a homomorphism h,. from 
the abelian group Homr(M,N) to the abelian group Homr(M,P). The following 
diagram suggests how h, should be defined. 


feet eee 
Take 
half) = hf. 
Note that if h is the identity on N, then h, is the identity on Hompr(M, N). 
Now suppose we have the following situation: 


A ey gery 2 aL 


Then (hg)«(f) = (hg) f = h(gf) = hs (gs (f)), so that 


To summarize, we have a mapping F' called a functor that takes an object A in a category C 
to an object F(A) in a category D; F also takes a morphism h: A — B inC to a morphism 
h, = F(h): F(A) > F(B) in D. The key feature of F' is the functorial property: 


F(hg) = F(h)F(g) and F(1a) = 1pva)- 
Thus a functor may be regarded as a homomorphism of categories. 


10.3.2 The Functor Hompa(—, NV) 


We now look at Homr(M, N) as a function of M, with N fixed. Here is an appropriate 
diagram: 


10.3. FUNCTORS 9 


If M is an R-module, we take F'(M) to be the abelian group Homa(M,N). Ifh: L > M 
is an R-module homomorphism, we take h* = F(h) to be a homomorphism from the 
abelian group Homr(M, N) to the abelian group Homr(L, N), given by 


It follows that 


hence 
(hg)* = g°h’, 


and if h is the identity on M, then h* is the identity on Homr(M, N). 

Thus F' does not quite obey the functorial property; we have F(hg) = F(g)F(h) 
instead of F(hg) = F(h)F(g). However, F' is a legal functor on the opposite category 
of R-mod. In the literature, Homr(—,N) is frequently referred to as a contravariant 
functor on the original category R-mod, and Homr(M,—) as a covariant functor on 
R-mod. 

If we replace the category of R-modules by an arbitrary category, we can still define 
functors (called hom functors) as in (10.3.1) and (10.3.2). But we must replace the 
category of abelian groups by the category of sets. 


10.3.3. The Functors M ®r— and —®r N 


To avoid technical complications, we consider tensor products of modules over a commu- 
tative ring R. First we discuss the tensor functor T = M ®@pr — The relevant diagram is 
given below. 


woe pS G 


If N is an R-module, we take T(N) = M @r N. If g: N > P is an R-module homomor- 
phism, we set T(g) =1w @g: M@®rN > M pe P, where 1, is the identity mapping 
on M [Recall that (liz ® g)(x# @ y) = % ® g(y).] Then 


T(f9) =1m ® fg = (1m ® f)(1m ® g) = T(f)T(9) 


and 
T(1n) = 1rv) 


so T is a functor from R-mod to R-mod. 

The functor S = — ®@R N is defined in a symmetrical way. If M is an R-module, then 
S(M) = M®RN, andif f: L > M isan R-module homomorphism, then $(f): L@krN 
M @R N is given by S(f) = f @ 1n. 
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10.3.4. Natural Transformations 


Again we will introduce this idea with an explicit example. The diagram below summarizes 
the data. 


rs| |r (1) 


We start with abelian groups A and B and a homomorphism f: A — B. We apply 
the forgetful functor, also called the underlying functor U. This is a fancy way of saying 
that we forget the algebraic structure and regard A and B simply as sets, and f as 
a mapping between sets. Now we apply the free abelian group functor F to produce 
F(A) = FUu(A), the free abelian group with A as basis (and similarly for F(B)). Thus 
F(A) is the direct sum of copies of Z, one copy for each a € A. The elements of F(A) 
can be represented as )>, n(a)x(a), n(a) € Z, where x(a) is the member of the direct 
sum that is 1 in the a” position and 0 elsewhere. [Similarly, we represent elements of B 
as }>,,n(b)y(b).] The mapping f determines a homomorphism Ff: F(A) > F(B), via 
¥ m(a)a(a) —» Yn(a)y(f(a)). 

Now let G be the identity functor, so that G(A) = A,G(B) = B,Gf = f. We define 
an abelian group homomorphism t4: F(A) — A by ta(> n(a)a(a)) = So n(a)a, and 
similarly we define tg(>_n(b)y(b)) = >> n(b)b. (Remember that we began with abelian 
groups A and B.) The diagram (1) is then commutative, because 


fta(w(a)) = f(a) and tal(F'f)(2(a))| = ta(y(f(@)) = f(a). 


To summarize, we have two functors, F' and G from the category C to the category D. 
(In this case, C = D = the category of abelian groups.) For all objects A,B € C and 
morphisms f: A — B, we have morphisms t4: F(A) — G(A) and tg: F(B) > G(B) 
such that the diagram (1) is commutative. We say that ¢ is a natural transformation from 
F to G. If for every object C € C, tc is an isomorphism (not the case in this example), t 
is said to be a natural equivalence. 

The key intuitive point is that the process of going from F(A) to G(A) is “natural” 
in the sense that as we move from an object A to an object B, the essential features of 
the process remains the same. 


Problems For Section 10.3 


1. Let F': S — T, where S and T are preordered sets. If we regard S' and T as categories, 
as in Section 10.2, Problem 1, what property must F' have in order to be a functor? 


2. A group may be regarded as a category with a single object 0, with a morphism for 
each element g € G. The composition of two morphisms is the morphism associated 
with the product of the elements. If F: G — A is a function from a group G toa 
group H, and we regard G and H as categories, what property must F' have in order 
to be a functor? 
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3. We now look at one of the examples that provided the original motivation for the 
concept of a natural transformation. We work in the category of vector spaces (over a 
given field) and linear transformations. If V is a vector space, let V* be the dual space, 
that is, the space of linear maps from V to the field of scalars, and let V** be the dual 
of V*. Ifu EV, let 0 ¢ V*™ be defined by u(f) = f(v), f ¢ V*. The mapping from v 
to U is a linear transformation, and in fact an isomorphism if V is finite-dimensional. 


Now suppose that f: V — W and g: W — X are linear transformations. Define 
f*: W* = V* by f*(a) =af, a € W*. Show that (gf)* = f*g*. 


4. The double dual functor takes a vector space V into its double dual V**, and takes a 
linear transformation f: V — W to f**: V** — W**, where f**(vu**) = v** f*. Show 
that the double dual functor is indeed a functor. 


5. Now consider the following diagram. 


i| ie 


w 


We take ty(v) = 7%, and similarly for tw. Show that the diagram is commutative, so 
that ¢t is a natural transformation from the identity functor to the double dual functor. 


In the finite-dimensional case, we say that there is a natural isomorphism between a 
vector space and its double dual. “Natural” means coordinate-free in the sense that it 
is not necessary to choose a specific basis. In contrast, the isomorphism of V and its 
single dual V* is not natural. 


6. We say that D is a subcategory of C if the objects of D are also objects of C, and 
similarly for morphisms (and composition of morphisms). The subcategory D is full if 
every C-morphism f: A — B, where A and B are objects of D (the key point) is also 
a D-morphism. Show that the category of groups is a full subcategory of the category 
of monoids. 


7. A functor F': C — D induces a map from C-morphisms to D-morphisms; f: A — B 
is mapped to Ff: FA — FB. If this map is injective for all objects A,B of C, we 
say that F is faithful. If the map is surjective for all objects A, B of C, we say that F 
is full. 


(a) The forgetful functor from groups to sets assigns to each group its underlying 
set, and to each group homomorphism its associated map of sets. Is the forgetful 
functor faithful? full? 


(b) We can form the product C x D of two arbitrary categories; objects in the product 
are pairs (A, A’) of objects, with A € C and A’ € D. A morphism from (A, A’) to 
(B, B’) is a pair (f,g), where f: A > B and g: A’ > B’. The projection functor 
from C x D to C takes (A, A’) to A and (f,g) to f. Is the projection functor 
faithful? full? 
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10.4. Exact Functors 


10.4.1 Definitions and Comments 


We are going to investigate the behavior of the hom and tensor functors when presented 
with an exact sequence. We will be working in the categories of modules and abelian 
groups, but exactness properties can be studied in the more general setting of abelian 
categories, which we now describe very informally. 

In any category C, let Home(A, B) (called a “hom set”) be the set of morphisms in C 
from A to B. [As remarked in (10.1.1), the formal definition of a category requires that 
Home (A, B) be a set for all objects A and B. The collection of all objects of C is a class 
but need not be a set.] For C to be an abelian category, the following conditions must be 
satisfied. 


. Each hom set is an abelian group. 
. The distributive laws f(g +h) = fg+ fh, (f +g)h = fh+ gh hold. 


. C has a zero object. 


er wn 


. Every finite set of objects has a product and a coproduct. (The existence of finite 
coproducts can be deduced from the existence of finite products, along with the re- 
quirements listed so far.) 


. Every morphism has a kernel and a cokernel. 
. Every monic is the kernel of its cokernel. 


. Every epic is the cokernel of its kernel. 


oN D Oo 


. Every morphism can be factored as an epic followed by a monic. 


Exactness of functors can be formalized in an abelian category, but we are going to 
return to familiar ground by assuming that each category that we encounter is R-mod 
for some R. When R = Z, we have the category of abelian groups. 


10.4.2 Left Exactness of Homr(M, —) 


Suppose that we have a short exact sequence 


jpeg a py (1) 


We apply the covariant hom functor F = Hompr(M, —) to the sequence, dropping the last 
term on the right. We will show that the sequence 


F F 
0—>+ FraA— rp Fo (2) 


is exact. A functor that behaves in this manner is said to be left exact. 
We must show that the transformed sequence is exact at FA and F'B. We do this in 
three steps. 


(a) Ff is monic. 
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Suppose that (F'f)(a) = fa = 0. Since f is monic (by exactness of the sequence (1)), 
a = 0 and the result follows. 


(b) im Ff Cker Fg. 


If 6 €imFf, then 6 = fa for some a € Homr(M, A). By exactness of (1), im f C 
ker g, so g8 = gfa=0a=0. Thus f € ker g. 


(c) ker Fg CimF'f. 


If 6 € ker Fg, then g8 = 0, with 6 € Homr(M,B). Thus if y € M, then B(y) € 
ker g = im f, so G(y) = f(x) for some x = a(y) € A. Note that x is unique since f is 
monic, and a € Homr(M, A). Thus G= facimFlf. & 


10.4.3 Left Exactness of Homr(—, NV) 


The contravariant hom functor G = Homr(—, N) is a functor on the opposite category, so 
before applying it to the sequence (1), we must reverse all the arrows. Thus left-exactness 
of G means that the sequence 


0 >GC >GB >GA (3) 


is exact. Again we have three steps. 
(a) Gg is monic. 

If (Gg)a = ag = 0, then a = 0 since g is epic. 
(b) im Gg CkerGf. 


If 6 €imGg, then @ = ag for some a € Homa(C,N). Thus (Gf)3 = Bf = agf = 0, 
so 0 € kerGf. 


(c) kerGf CimGg. 


Let 6 € Hompr(B, N) with G € ker Gf, that is, Gf =0. If y € C, then since g is epic, 
we have y = g(x) for some x € B. If g(a) = g(x2), then x1 — x2 € kerg = im f, hence 
XL, — #2 = f(z) for some z € A. Therefore G(x1) — B(a2) = G(f(z)) = 0, so it makes sense 
to define a(y) = G(x). Then a € Homr(C, N) and ag = £, that is, (Gg)a= 8. & 


10.4.4 Right Exactness of the Functors M®rp— and —®rpN 


If we apply the functor H = M @p — to the exact sequence 


0—>+Aa—> p—*>c—+0 
[see (1) of (10.4.2)], we will show that the sequence 


A A 
HA pS sh (4) 


is exact. A similar result holds for — @r N. A functor that behaves in this way is said to 
be right exact. Once again, there are three items to prove. 
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(i) Hg is epic. 


An element of M ® C is of the form t = }0,2; ® y with x; € M and y% € C. 
Since g is epic, there exists z; € B such that g(z:) = y:. Thus (1 ® g)()0, 2; ® x) = 
yo; 21 @ g(%i) = t. 


(ii) im Hf C ker Hg. 
This is a brief computation: (1® g9)(1@ f) =1@g9f=1@0=0. 
(iii) ker Hg Cim Af. 


By (ii), the kernel of 1 @ g contains L = im(1® f), so by the factor theorem, there is a 
homomorphism 7: (M @pr B)/L— M ®RC such that g/m @b+ L) =m@g(b),meM, 
be B. 

Let a be the canonical map of M ®r B onto (M @r B)/L. Then gx(m ®@ b) = 
g(m ® b+ L) = m® g(b), so 


gm =1@Q. 
If we can show that g is an isomorphism, then 
ker(1 ® g) = ker(G7) = kerm# = L=im(1® f) 


and we are finished. To show that g is an isomorphism, we will display its inverse. First 
let h be the bilinear map from M x C to (M @pr B)/L given by h(m,c) = m@b+L, where 
g(b) = c. [Such a b exists because g is epic. If g(b) = g(b’) =, then b—b' Ekerg=imf, 
so b— 0b! = f(a) for somea€ A. Thenm®b-—meb'=m@® f(a)=(1@8 f)(m@a) EL, 
and h is well-defined.]| By the universal mapping property of the tensor product, there is 
a homomorphism h: M @p C — (M ®p B)/L such that 


h(m® c) = h(m,c) =m ®@ b+ L, where g(b) = c. 
But g: (M @r B)/L > M ®RC and 
g(m @ b+ L)=m@g(b)=mM@c. 


Thus h is the inverse of 7. & 


10.4.5 Definition 


A functor that is both left and right exact is said to be exact. Thus an exact functor is 
one that maps exact sequences to exact sequences. We have already seen one example, 
the localization functor (Section 8.5, Problems 4 and 5). 

If we ask under what conditions the hom and tensor functors become exact, we are 
led to the study of projective, injective and flat modules, to be considered later in the 
chapter. 
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Problems For Section 10.4 


In Problems 1-3, we consider the exact sequence (1) of (10.4.2) with R = Z, so that we 
are in the category of abelian groups. Take A = Z, B = Q, the additive group of rational 
numbers, and C' = Q/Z, the additive group of rationals mod 1. Let f be inclusion, and g 
the canonical map. We apply the functor F = Homr(M, —) with M = Z». [We will omit 
the subscript R when R = Z, and simply refer to Hom(M, —).] 


1. Show that Hom(Z2, Q) = 0. 
2. Show that Hom(Z2, Q/Z) # 0. 
3. Show that Hom(Z2, —) is not right exact. 
In Problems 4 and 5, we apply the functor G = Hom(—,N) to the above exact 
sequence, with N = Z. 
4. Show that Hom(Q, Z) = 0. 
5. Show that Hom(—, Z) is not right exact. 
Finally, in Problem 6 we apply the functor H = M ® — to the above exact sequence, 
6. Show that Zz @ — (and similarly — @ Zz) is not left exact. 
7. Refer to the sequences (1) and (2) of (10.4.2). If (2) is exact for all possible R-modules 
M, show that (1) is exact. 
8. State an analogous result for the sequence (3) of (10.4.3), and indicate how the result 
is proved. 


10.5 Projective Modules 


Projective modules are direct summands of free modules, and are therefore images of 
natural projections. Free modules are projective, and projective modules are sometimes 
but not always free. There are many equivalent ways to describe projective modules, and 
we must choose one of them as the definition. In the diagram below and the definition 
to follow, all maps are R-module homomorphisms. The bottom row is exact, that is, g is 
surjective. 


10.5.1 Definition 


The R-module P is projective if given f: P — N, and g: M — N surjective, there 
exists h: P — M (not necessarily unique) such that the diagram is commutative, that is, 
f = gh. We sometimes say that we have “lifted” f to h. 

The definition may look obscure, but the condition described is a familiar property of 
free modules. 
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10.5.2 Proposition 
Every free module is projective. 


Proof. Let S be a basis for the free module P. By (4.3.6), f is determined by its behavior 
on basis elements s € S. Since g is surjective, there exists a € M such that g(a) = f(s). 
Take h(s) = a and extend by linearity from S to P. Since f = gh on S, the same must 
be true onallof P. & 


Here is the list of equivalences. 


10.5.3. Theorem 

The following conditions on the R-module P are equivalent. 
(1) P is projective. 

(2) The functor Hompr(P, —) is exact. 

(3) Every short exact sequence 0 — M — N > P — 0 splits. 


(4) P is a direct summand of a free module. 


Proof. (1) is equivalent to (2). In view of the left exactness of F = Hompr(P, —) (see 
(10.4.2)), (2) says that if g: M — N is surjective, then so is Fg: FM — FN. But Fg 
maps h: P — M to gh: P — N, so what we must prove is that for an arbitrary morphism 
f: P—N, there exists h: P — M such that gh = f. This is precisely the definition of 
projectivity of P. 

(2) implies (3). Let 0 - M — N > P— 0 bea short exact sequence, with g: N — P 
(necessarily surjective). Since P is projective, we have the following diagram. 


ae 


Mae 


Thus there exists h: P — N such that gh = 1p, which means that the exact sequence 
splits (see (4.7.1)). 

(3) implies (4). By (4.3.6), P is a quotient of a free module, so there is an exact 
sequence 0 > M — N — P > 0 with N free. By (3), the sequence splits, so by (4.7.4), 
P is a direct summand of N. 

(4) implies (1). Let P be a direct summand of the free module F’, and let 7 be the 
natural projection of F' on P; see the diagram below. 


FP— =p 
1 | 
MN 0 


Given f: P > N, we have fa: F — N, so by (10.5.2) there exists h: F — M such that 
fa = gh. If h’ is the restriction of h to P, then f = gh’. & 
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10.5.4 Corollary 
The direct sum P = @P; is projective if and only if each P; is projective. 


Proof. If P is a direct summand of a free module, so is each P;, and therefore the P; 
are projective by (4) of (10.5.3). Conversely, assume that each P; is projective. Let 
f: P — N and g: M — N, with g surjective. If 7; is the inclusion map of P; into P, 
then fi;: P; — N can be lifted to hj: P; — M such that fz; = gh;. By the universal 
mapping property of direct sum (Section 10.2), there is a morphism h: P — M such that 
hi; = h,; for all 7. Thus fi; = ghi; for every 7, and it follows from the uniqueness part of 
the universal mapping property that f= gh. d& 


If we are searching for projective modules that are not free, the following result tells 
us where not to look. 


10.5.5 Theorem 


A module M over a principal ideal domain R is projective if and only if it is free. 


Proof. By (10.5.2), free implies projective. If M is projective, then by (4) of (10.5.3), 
is a direct summand of a free module. In particular, M is a submodule of a free module, 
hence is free by (4.6.2) and the discussion following it. & 


10.5.6 Examples 


1. A vector space over a field k is a free k-module, hence is projective. 
2. A finite abelian group G is not a projective Z-module, because it is not free. [If g € G 
and n = |G|, then ng = 0, so g can never be part of a basis. ] 


3. If p and q are distinct primes, then R = Zpg = Zp © Zq. We claim that Z, and Zy 
are projective but not free R-modules. (As in Example 2, they are not projective 
Z-modules.) This follows from (4) of (10.5.3) and the fact that any ring R is a free 
R-module (with basis {1}). 


Problems For Section 10.5 


In Problems 1—5, we are going to prove the projective basis lemma, which states that an R- 
module P is projective if and only if there are elements x; € P (i € I) and homomorphisms 
fi: P — R such that for every x € P, f;(x) = 0 for all but finitely many 7 and 


The set of x,;’s is referred to as the projective basis. 


1. To prove the “only if” part, let P be a direct summand of the free module F with basis 
{e;}. Take f to be the inclusion map of P into F’, and z the natural projection of F 
onto P. Show how to define the f; and x; so that the desired results are obtained. 
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2. To prove the “if” part, let F be a free module with basis {e;,i € I}, and define 
a: F — P by r(e;) = 2. Define f: P > F by f(x) = 05; fi(x)ex. Show that af is the 
identity on P. 

3. Continuing Problem 2, show that P is projective. 


4. Assume that P is finitely generated by n elements. If R” is the direct sum of n copies 
of R, show that P is projective iff P is a direct summand of R”. 


5. Continuing Problem 4, show that if P is projective and generated by n elements, then 
P has a projective basis with n elements. 


6. Show that a module P is projective iff P is a direct summand of every module of 
which it is a quotient. In other words, if P = M/N, then P is isomorphic to a direct 
summand of M. 


7. In the definition (10.5.1) of a projective module, give an explicit example to show that 
the mapping hf need not be unique. 


10.6 Injective Modules 


If we reverse all arrows in the mapping diagram that defines a projective module, we 
obtain the dual notion, an injective module. In the diagram below, the top row is exact, 
that is, f is injective. 


10.6.1 Definition 


The R-module E is injective if given g: N — E, and f: N — M injective, there exists 
h: M — E (not necessarily unique) such that g = hf. We sometimes say that we have 
“lifted” g to h. 

As with projectives, there are several equivalent ways to characterize an injective 
module. 


10.6.2 Theorem 

The following conditions on the R-module EF are equivalent. 
(1) E is injective. 

(2) The functor Homr(—, FE) is exact. 

(3) Every exact sequence 0 — E + M — N — 0 splits. 


Proof. (1) is equivalent to (2). Refer to (3) of (10.4.3), (1) of (10.4.2) and the definition 
of the contravariant hom functor in (10.3.2) to see what (2) says. We are to show that 
if f: N — M is injective, then f*: Homry(M,E) — Homp(N, £) is surjective. But 
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f*(h) = hf, so given g: N — E, we must produce h: M — E such that g = hf. This is 
precisely the definition of injectivity. 

(2) implies (3). Let 0 - E — M — N — 0 be ashort exact sequence, with f: HE — M 
(necessarily injective). Since E is an injective module, we have the following diagram: 


jy 


A 


E 


Thus there exists g: M — FE such that gf = 1z, which means that the exact sequence 
splits. 

(3) implies (1). Given g: N — E, and f: N — M injective, we form the pushout of f 
and g, which is a commutative square as indicated in the diagram below. 


N M 
| |» 
E Q 


Detailed properties of pushouts are developed in the exercises. For the present proof, all 
we need to know is that since f is injective, so is f’. Thus the sequence 


J; 


a 


——_ 
bm 


0 —+ #“+Q—+Q/im Ff’ —+0 


is exact. By (3), there exists h: Q — E such that hf’ = 1”. We now have hg’: M > E 
with hg’ f = hf'g =1eg = g, proving that F is injective. & 


We proved in (10.5.4) that a direct sum of modules is projective iff each component 
is projective. The dual result holds for injectives. 


10.6.3. Proposition 


A direct product |] js of modules is injective iff each E; is injective. Consequently, a 
finite direct sum is injective iff each summand is injective. 


Proof. If f: N — M is injective and g: N — |], Fi, let 9: = pig, where p; is the projection 
of the direct product on E;. Then finding h: M — J], E; such that g = hf is equivalent 
to finding, for each i, a morphism h;: M — E; such that g; = hif. [If pig = hif = pihf 
for every i, then g = hf by the uniqueness part of the universal mapping property for 
products.] The last assertion holds because the direct sum of finitely many modules 
coincides with the direct product. & 


In checking whether an R-module F is injective, we are given g: N — E, and 
f: N — M, with f injective, and we must lift g toh: M — E with g = hf. The 
next result drastically reduces the collection of maps f and g that must be examined. We 
may take M = R and restrict N to a left ideal J of R, with f the inclusion map. 
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10.6.4. Baer’s Criterion 


The R-module E is injective if and only if every R-homomorphism f: I — EF, where I is 
a left ideal of R, can be extended to an R-homomorphism h: R — E. 


Proof. The “only if” part follows from the above discussion, so assume that we are given 
g: N > E and f: N — M, where (without loss of generality) f is an inclusion map. 
We must extend g to h: M — E. A standard Zorn’s lemma argument yields a maximal 
extension go in the sense that the domain Mo of go cannot be enlarged. [The partial 
ordering is (g1,D1) < (g2,De2) iff Di C De and gi = go on D,.] If Mo = M, we are 
finished, so assume x € M \ Mo. Let I be the left ideal {r € R: rx € Mo}, and define 
ho: I — E by ho(r) = go(rx). By hypothesis, ho can be extended to hg: R — E. Let 
M, = Mo + Ra and define hy: M, — E by 


hi(ao + rx) = go(xo) + rho(1). 


To show that h; is well defined, assume rg + rx = yo + sx, with ro, yo € Mo and r,s € R. 
Then (r — s)a% = yo — 0 € Mo, sor—s € J. Using the fact that hj extends ho, we have 


Go(Yo — 0) = go((r — s)") = ho(r — 8) = ho(r — 8) = (r — s)ho(1) 


and consequently, go(%o) + rho(1) = go(yo) + sho(1) and hy is well defined. If x9 € Mo, 
take r = 0 to get hi(xo) = go(%o), so hi is an extension of go to M; D Mo, contradicting 
the maximality of go. We conclude that Mo = M. dh 


Since free modules are projective, we can immediately produce many examples of 
projective modules. The primary source of injective modules lies a bit below the surface. 


10.6.5 Definitions and Comments 


Let R be an integral domain. The R-module M is divisible if each y € M can be divided 
by any nonzero element r € R, that is, there exists « € M such that rx = y. For 
example, the additive group of rational numbers is a divisible abelian group, as is Q/Z, 
the rationals mod 1. The quotient field of any integral domain (regarded as an abelian 
group) is divisible. A cyclic group of finite order n > 1 can never be divisible, since it is 
not possible to divide by n. The group of integers Z is not divisible since the only possible 
divisors of an arbitrary integer are +1. It follows that a nontrivial finitely generated 
abelian group, a direct sum of cyclic groups by (4.6.3), is not divisible. 

It follows from the definition that a homomorphic image of a divisible module is 
divisible, hence a quotient or a direct summand of a divisible module is divisible. Also, a 
direct sum of modules is divisible iff each component is divisible. 


10.6.6 Proposition 


If R is any integral domain, then an injective R-module is divisible. If R is a PID, then 
an R-module is injective if and only if it is divisible. 
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Proof. Assume E is injective, and let y © FE, r € R, r #0. Let I be the ideal Rr, 
and define an R-homomorphism f: J — E by f(tr) = ty. If tr = 0, then since R is 
an integral domain, t = 0 and f is well defined. By (10.6.4), f has an extension to an 
R-homomorphism h: R — E. Thus 


y = f(r) = Ar) = Ard) = rh(1) 


so division by r is possible and F is divisible. Conversely, assume that R is a PID and 
E is divisible. Let f: I — E, where I is an ideal of R. Since R is a PID, J = Rr for 
some r € R. We have no trouble extending the zero mapping, so assume r 4 0. Since EF 
is divisible, there exists x € E such that rr = f(r). Define h: R > E by h(t) = ta. If 
t € R, then 


h(tr) = tra =tf(r) = f(tr) 


so h extends f, proving F injective. d& 


Problems For Section 10.6 


We now describe the construction of the pushout of two module homomorphisms f: A—C 
and g: A — B; refer to Figure‘10.6.1. Take 


D=(B@C)/W, where W = {(g(a), —f(a)): a € A}, 
and 
g'(c) =(0,c)+W,  f’(b) = (0,0) + W. 


In Problems 1-6, we study the properties of this construction. 


A360 
|e 
Ba Sp . 
e 
E 


Figure 10.6.1 


1. Show that the pushout square ACDB is commutative, that is, f’g = g’f. 


2. Suppose we have another commutative pushout square ACE B with maps f”: B— E 
and g”: C — E, as indicated in Figure 10.6.1. Define h: D > E by 


h((b,c) + W) = g"(o) + f"(0). 


Show that h is well defined. 
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. Show that h makes the diagram commutative, that is, hg’ = g” and hf’ = f”. 
. Show that if h’: D — E makes the diagram commutative, then h’ = h. 


The requirements stated in Problems 1, 3 and 4 can be used to define the pushout 
via a universal mapping property. The technique of (10.2.2) shows that the pushout 
object D is unique up to isomorphism. 


. If f is injective, show that f’ is also injective. By symmetry, the same is true for g 


and g’. 


. If f is surjective, show that jf’ is surjective. By symmetry, the same is true for 
J y Sy y: g 


and g’. 


Problems 7-10 refer to the dual construction, the pullback, defined as follows (see 


Figure 10.6.2). Given f: A > B and g: C > B, take 


and 


LT, 


12. 


D={(a,c) € AGC: fla) = g(o)} 


D—~A 
g 
f [1 


C—>B 


Figure 10.6.2 


. Show that the pullback square DABC is commutative, that is, fg’ = gf’. 
. If we have another commutative pullback square EABC with maps f”: FE = C 


and g”: FE — A, show that there is a unique h: E — D that makes the diagram 
commutative, that is, g’/h = g” and f/h= f”. 


. If f is injective, show that f’ is injective. By symmetry, the same is true for g and g’. 
10. 


If f is surjective, show that f’ is surjective. By symmetry, the same is true for g 
and g’. 

Let R be an integral domain with quotient field Q, and let f be an R-homomorphism 
from an ideal J of R to Q. Show that f(x)/x is constant for all nonzero x € I. 
Continuing Problem 11, show that Q is an injective R-module. 


10.7 Embedding into an Injective Module 


We know that every module is a quotient of a projective (in fact free) module. In this 
section we prove the more difficult dual statement that every module can be embedded 
in an injective module. (To see that quotients and submodules are dual, reverse all the 
arrows in a short exact sequence.) First, we consider abelian groups. 
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10.7.1 Proposition 
Every abelian group can be embedded in a divisible abelian group. 


Proof. If A is an abelian group, then A is a quotient of a free abelian group F’, say 
A= F/B. Now F is a direct sum of copies of Z, hence F' can be embedded in a direct 
sum D of copies of Q, the additive group of rationals. It follows that F/B can be embedded 
in D/B; the embedding is just the inclusion map. By (10.6.5), D/B is divisible, and the 
result follows. & 


10.7.2. Comments 


In (10.7.1), we used Q as a standard divisible abelian group. It would be very desirable 
to have a canonical injective R-module. First, we consider H = Homz(R, A), the set of 
all abelian group homomorphisms from the additive group of the ring R to the abelian 
group A. If we are careful, we can make this set into a left R-module. The abelian group 
structure of H presents no difficulties, but we must also define scalar multiplication. If 
f € Hand s € R, we set 


(sf)(r) = f(rs), re R. 
Checking the module properties is routine except for associativity: 
((ts)f)(r) = f(rts), and (t(sf))(r) = (sf)(rt) = f(rts) 


o (ts)f = t(sf). Notice that sf is an abelian group homomorphism, not an R-module 
homomorphism. 
Now if A is an R-module and B an abelian group, we claim that 


Homz(A, B) = Homr(A, Homz(R, B)), (1) 
equivalently, 
Homz(R @r A, B) = Homr(A, Homz(R, B)). (2) 


This is a special case of adjoint associativity: 
If sMrp, RN, sP, then 


Homs(M ®p N, P) ¥ Homa(N,Homs(M, P)). (3) 
Thus if F is the functor M @pr — and G is the functor Homg(M, —), then 
Homs(FN, P) ¥ Homar(N, GP) (4) 


which is reminiscent of the adjoint of a linear operator on an inner product space. We 
say that F and G are adjoint functors, with F left adjoint to G and G right adjoint 
to F. [There is a technical naturality condition that is added to the definition of adjoint 
functors, but we will not pursue this since the only adjoints we will consider are hom and 
tensor. 
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Before giving the formal proof of (3), we will argue intuitively. The left side of the 
equation describes all biadditive, R-balanced maps from M x N to P. If (2, y) — f(a, y), 
xz € M, y € N, is such a map, then f determines a map g from N to Homgs(M, P), 
namely, g(y)(z) = f(x,y). This is a variation of the familiar fact that a bilinear map 
amounts to a family of linear maps. Now if s € S, then g(y)(sx) = f(sa,y), which need 
not equal sf(x,y), but if we factor f through the tensor product M @pr N, we can then 
pull out the s. Thus g(y) € Homs(M, P). Moreover, g is an R-module homomorphism, 
because g(ry)(x) = f(x, ry), and we can factor out the r by the same reasoning as above. 
Since g determines f, the correspondence between f and g is an isomorphism of abelian 
groups. 

To prove (3), let f: M @prN — P be an S-homomorphism. If y € N, define 
fy: M — P by fy(x) = f(x @ y), and define y(f): N — Homgs(M,P) by y — fy. 
[Homs(M, P) is a left R-module by Problem 1.] 


(a) w(f) is an R-homomorphism: 
OS) 1 + 92) = frrtus = far + fue = VP) (M1) + YF) (92); 
W(f)(ry) = fry =r fy = (rd(F))(y)- 
[fry(x) = f(@ @ ry) and (rfy)(x) = fy(zr) = f(ar @ y),] 


(b) w is an abelian group homomorphism: 
We have f,(z) + g(x) = fle @y) +9(e @y) = (F +92 Ov) = UF + a)y(2) 50 
vf +9) = ¥(f) + (9). 

(c) w is injective: 
If ~(f) = 0, then f, = 0 for all y € N, so f(x @y) = 0 for alla € M andy e N. 
Thus f is the zero map. 

(d) If g € Homr(N, Homs(M, P)), define y,: M x N > P by (x,y) = g(y)(x). Then 
Yq is biadditive and R-balanced: 
By definition, y, is additive in each coordinate, and [see Problem 1] yg(ar,y) = 
gly)(2r) = (rg(y))(@) = g(ry)(®) = Po(a, ry). 

(e) w is surjective: 


By (d), there is a unique S-homomorphism 3(g): M®@rN — P such that 6(g)(r@y) 
yolx,y) = o(y)(a), « € M, y EN. It follows that J(6(g)) = 9, because +4(8(9))(y) = 
B(g)y, where B(g)y(x) = B(g)(z @ y) = g(y)(2). Thus 8(g)y = g(y) for all y in N, so pB 
is the identity and w is surjective. 

It follows that w is an abelian group isomorphism. This completes the proof of (3). 

Another adjointness result, which can be justified by similar reasoning, is that if Np, 
RMs, Pg, then 


I 


Homg(N @p M, P) Y Homp(N,Homs(M, P)) (5) 


which says that F = — ®r M and G = Homg(M, —) are adjoint functors. 


10.7.3. Proposition 
If F is a divisible abelian group, then Homz(R, £) is an injective left R-module. 
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Proof. By (10.6.2), we must prove that Homr(—, Homz(R, E£)) is exact. As in the proof 
of (1) implies (2) in (10.6.2), if 0 - N — M is exact, we must show that 


Homr(M,Homz(R, £)) - Homr(N, Homz(R, E)) — 0 
is exact. By (1) of (10.7.2), this is equivalent to showing that 
Homz(M, EF) — Homz(N, £) — 0 


is exact. [As indicated in the informal discussion in (10.7.2), this replacement is allowable 
because a bilinear map can be regarded as a family of linear maps. A formal proof would 
invoke the naturality condition referred to in (10.7.2).] Since FE is an injective Z-module, 
the result now follows from (10.6.2). & 


We can now prove the main result. 


10.7.4 Theorem 
If M is an arbitrary left R-module, then M can be embedded in an injective left R-module. 


Proof. If we regard M as an abelian group, then by (10.7.1), we can assume that M is a 
subset of the divisible abelian group EF. We will embed M in the injective left R-module 
N = Homz(R, E) (see (10.7.3)). If m € M, define f(m): R — E by f(m)(r) = rm. 
Then f: M — N, and we claim that f is an injective R-module homomorphism. If 
f(mi) = f(m2), then rm, = rm2 for every r € R, and we take r = 1 to conclude 
that m1 = me, proving injectivity. To check that f is an R-homomorphism, note that if 
r,s€ Rand me M, then 


f(sm)(r) = rsm and (sf(m))(r) = f(m)(rs) = rsm 
by definition of scalar multiplication in the R-module N; see (10.7.2). & 


It can be shown that every module M has an injective hull, that is, there is a smallest 
injective module containing M. 


Problems For Section 10.7 

1. If eMs and RN, show that Homr(M, N) is a left S-module via 
(sf)(m) = f(ms). 

2. If rMg and Ng, show that Homs(M, N) is a right R-module via 
(fr)(m) = f(rm). 

3. If Mr and s Nr, show that Homr(M, N) is a left S-module via 


(sf)(m) = sf(m). 
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4. If sM and sNp, show that Homg(M, N) is a right R-module via 
(fr)(m) = f(m)r. 


5. A useful mnemonic device for remembering the result of Problem 1 is that since M 
and WN are left R-modules, we write the function f on the right of its argument. The 
result is m(sf) = (ms) f, a form of associativity. Give similar devices for Problems 2, 3 
and 4. 


Note also that in Problem 1, M is a right S-module, but Homr(M, N) is a left S- 
module. The reversal might be expected, because the hom functor is contravariant in its 
first argument. A similar situation occurs in Problem 2, but in Problems 3 and 4 there is 
no reversal. Again, this might be anticipated because the hom functor is covariant in its 
second argument. 


6. Let R be an integral domain with quotient field Q. If M is a vector space over Q, show 
that M is a divisible R-module. 


7. Conversely, if M is a torsion-free divisible R-module, show that M is a vector space 
over Q. 


8. If R is an integral domain that is not a field, and Q is the quotient field of R, show 
that Homr(Q, R) = 0. 


10.8 Flat Modules 


10.8.1 Definitions and Comments 


We have seen that an R-module M is projective iff its covariant hom functor is exact, 
and M is injective iff its contravariant hom functor is exact. It is natural to investigate 
the exactness of the tensor functor M ®pr —, and as before we avoid complications by 
assuming all rings commutative. We say that M is flat if M @p — is exact. Since the 
tensor functor is right exact by (10.4.4), an equivalent statement is that if f: A — B is 
an injective R-module homomorphism, then 


1@f:M2A>MEB 


is injective. In fact it suffices to consider only R-modules A and B that are finitely 
generated. This can be deduced from properties of direct limits to be considered in the 
next section. [Any module is the direct limit of its finitely generated submodules (10.9.3, 
Example 2). The tensor product commutes with direct limits (Section 10.9, Problem 2). 
The direct limit is an exact functor (Section 10.9, Problem 4).] A proof that does not 
involve direct limits can also be given; see Rotman, “An Introduction to Homological 
Algebra”, page 86. 


10.8.2 Example 


Since Zp @z — is not exact (Section 10.4, Problem 6), Zz is not a flat Z-module. 
The next result is the analog for flat modules of property (10.5.4) of projective modules. 
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10.8.3 Proposition 


The direct sum ©; ; is flat if and only if each M; is flat. 


Proof. Let f: A — B be an injective R-homomorphism. In view of (8.8.6(b)), investigat- 
ing the flatness of the direct sum amounts to analyzing the injectivity of the mapping 


given by 
Li, OAL +--+ + Uji, @An Vj, @ f(ar) +--+ + 24, @ f(an). 


The map g will be injective if and only if all component maps x; @ a; > x; ® f(a;) are 
injective. This says that the direct sum is flat iff each component is flat. d& 


We now examine the relation between projectivity and flatness. 


10.8.4 Proposition 


Ris a flat R-module. 


Proof. If f: A — B is injective, we must show that (1@f): R@RA— Rp B is injective. 
But by (8.7.6), R®eM = M viar®x > rx. Thus the following diagram is commutative. 


1 
Rosh BOER 


A — > B 


f 


Therefore injectivity of 1 ® f is equivalent to injectivity of f, and the result follows. d& 


10.8.5 Corollary 


Every projective module, hence every free module, is flat. 


Proof. By (10.8.3) and (10.8.4), every free module is flat. Since a projective module is a 
direct summand of a free module, it is flat by (10.8.3). & 


Flat abelian groups can be characterized precisely. 
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10.8.6 Theorem 


A Z-module is flat iff it is torsion-free. 


Proof. Suppose that M is a Z-module that is not torsion-free. Let « € M be a nonzero 
element such that nz = 0 for some positive integer n. If f: Z — Z is multiplication by n, 
then (1@ f): M@®Z— M ®Zis given by 


Yy®B@zrySnz=ny&z 


so that (1@ f)(2@1) =nz@1=0. Since x @1 corresponds to x under the isomorphism 
between M ® Zand M,x®1#0, and 1 f is not injective. Therefore M is not flat. 

The discussion in (10.8.1) shows that in checking flatness of M, we can restrict to 
finitely generated submodules of M. [We are examining equations of the form 
(1 ® f)(t) = 0, where t = S>"_, @; @ yj, 4} € M, y; € A.JThus without loss of gener- 
ality, we can assume that M is a finitely generated abelian group. If M is torsion-free, 
then by (4.6.5), M is free and therefore flat by (10.8.5). & 


10.8.7 Corollary 


The additive group of rationals Q is a flat but not projective Z-module. 


Proof. Since Q is torsion-free, it is flat by (10.8.6). If Q were projective, it would be free 
by (10.5.5). This is a contradiction (see Section 4.1, Problem 5). & 


Sometimes it is desirable to change the underlying ring of a module; the term base 
change is used in these situations. 


10.8.8 Definitions and Comments 


If f: R— Sis aring homomorphism and M is an S-module, we can create an R-module 
structure on M by ra = f(r)z, r € R, x € M. This is a base change by restriction of 
scalars. 

If f: R — S is aring homomorphism and M is an R-module, we can make S ®p M 
into an S-module via 


s(s’ @a)=ss8' @a, 5,8'€S, EM. 


This is a base change by extension of scalars. Note that S is an R-module by restriction 
of scalars, so the tensor product makes sense. What we are doing is allowing linear 
combinations of elements of M with coefficients in S. This operation is very common in 
algebraic topology. 

In the exercises, we will look at the relation between base change and flatness. There 
will also be some problems on finitely generated algebras, so let’s define these now. 
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10.8.9 Definition 


The R-algebra A is finitely generated if there are elements a1,...,@n € A such that every 
element of A is a polynomial in the a;. Equivalently, the algebra homomorphism from the 
polynomial ring R[X1,...,X,] — A determined by X; — a;, i = 1,...,n, is surjective. 
Thus A is a quotient of the polynomial ring. 

It is important to note that if A is finitely generated as an R-module, then it is finitely 
generated as an R-algebra. [If a = ria, + -+-+7n@n, then a is certainly a polynomial in 
the a;.| 


Problems For Section 10.8 


1. Give an example of a finitely generated R-algebra that is not finitely generated as an 
R-module. 


2. Show that R[X] @r RY] = R[X,Y]. 
3. Show that if A and B are finitely generated R-algebras, so is A ®pr B. 


4. Let f: RS be a ring homomorphism, and let M be an S-module, so that M is an 
R-module by restriction of scalars. If S$ is a flat R-module and M is a flat S-module, 
show that M is a flat R-module. 


5. Let f: R— S be a ring homomorphism, and let M be an R-module, so that S ®p M 
is an S-module by extension of scalars. If M is a flat R-module, show that S ®pr M is 
a flat S-module. 


6. Let S be a multiplicative subset of the commutative ring R. Show that for any R- 
module M, S-'R@rM & S“!'M via a: (r/s) ®@x > ra/s with inverse 3: x/s > 
(1/s) @a. 

7. Continuing Problem 6, show that S~'R is a flat R-module. 


10.9 Direct and Inverse Limits 


If M is the direct sum of R-modules M;, then R-homomorphisms f;: MM; — N can be lifted 
uniquely to an R-homomorphism f: M— N. The direct limit construction generalizes 
this idea. [In category theory, there is a further generalization called the colimit. The 
terminology is consistent because the direct sum is the coproduct in the category of 
modules. ] 


10.9.1 Direct Systems 


A directed set is a partially ordered set J such that given any i,7 € J, there exists k € I 
such that i << k and j <k. A typical example is the collection of finite subsets of a set, 
ordered by inclusion. If A and B are arbitrary finite subsets, then both A and B are 
contained in the finite set AU B. 

Now suppose I is a directed set and we have a collection of objects A;,i € J, ina 
category C. Assume that whenever 7 < j, there is a morphism h(i,7): A; > A;. Assume 
further that the h(i,7) are compatible in the sense that if i < 7 < k and we apply h(i, 7) 
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followed by h(j,k), we get h(t,k). We also require that for each 7, h(i,2) is the identity 
on A;. The collection of objects and morphisms is called a direct system. As an example, 
take the objects to be the finitely generated submodules of a module, and the morphisms 
to be the natural inclusion maps. In this case, the directed set coincides with the set of 
objects, and the partial ordering is inclusion. 


10.9.2 Direct Limits 


Suppose that {A;,h(7,7), 7,7 € I} is a direct system. The direct limit of the system will 
consist of an object A and morphisms a;: A; — A. Just as with coproducts, we want to 
lift morphisms f;: A; > B toa unique f: A— B, that is, fa; = f; for all 7 €¢ J. But we 
require that the maps a; be compatible with the h(i,7), in other words, a;h(i,j) = a; 
whenever i < j. A similar constraint is imposed on the f;, namely, f;h(t,j) = fi, 1 < 7. 
Thus the direct limit is an object A along with compatible morphisms a;: A; — A such 
that given compatible morphisms f;: A; — B, there is a unique morphism f: A > B 
such that fa; = f; for all 7. Figure 10.9.1 summarizes the discussion. 


nin) “ 


Figure 10.9.1 
As in Section 10.2, any two direct limits of a given direct system are isomorphic. 
If the ordering on I is the equality relation, then the only element j such that i <j 
is 7 itself. Compatibility is automatic, and the direct limit reduces to a coproduct. 
A popular notation for the direct limit is 


The direct limit is sometimes called an inductive limit. 


10.9.3. Examples 


1. A coproduct is a direct limit, as discussed above. In particular, a direct sum of modules 
is a direct limit. 

2. Any module is the direct limit of its finitely generated submodules. [Use the direct 
system indicated in (10.9.1).] 

3. The algebraic closure of a field F can be constructed (informally) by adjoining roots of 
all possible polynomials in F'[X]; see (3.3.7). This suggests that the algebraic closure 
is the direct limit of the collection of all finite extensions of F’. This can be proved 
with the aid of (3.3.9). 


In the category of modules, direct limits always exist. 
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10.9.4 Theorem 


If {M;, h(i, 7),7,7 € I} is a direct system of R-modules, then the direct limit of the system 
exists. 


Proof. Take M to be (@;Mi)/N, with N the submodule of the direct sum generated by 
all elements of the form 


Bhi, 9) ai — Biti, 1S 9, BE Mi (1) 
where (; is the inclusion map of M; into the direct sum. Define a;: M; — M by 
a,x; = Bx; + N. 
The a; are compatible, because 
a,h(t, j)a; = B h(t, ei + N = Bia, + N = aya. 
Given compatible f;: M; — B, we define f: M— B by 
f(Giai t+ N) = fii, 


the only possible choice. This forces fa; = f;, provided we show that f is well-defined. 
But an element of N of the form (1) is mapped by our proposed f to 


Fi ht, Jai — faxi 


which is 0 by compatibility of the f;. Thus f maps everything in N to 0, and the result 
follows. d& 


10.9.5 Inverse Systems and Inverse Limits 


Inverse limits are dual to direct limits. An inverse system is defined as in (10.9.1), 
except that if ¢ < j, then h(i,j) maps “backwards” from A; to A;. If we apply h(j,k) 
followed by h(i,7), we get h(i,k); as before, h(i,z) is the identity on A;. The inverse 
limit of the inverse system {A;,h(i,7),i,7 € I} is an object A along with morphisms 
pi: A — A;. As with products, we want to lift morphisms f;: B — A; to a unique 
f: B — A. There is a compatibility requirement on the p; and fi: if i < j, then 
h(i, j)p; = pi, and similarly h(i, 7) f; = fi. Thus the inverse limit is an object A along with 
compatible morphisms p;: A — A; such that given compatible morphisms f;: B — Aj, 
there is a unique morphism f: B — A such that p;f = f; for all 7. See Figure 10.9.2. 
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Figure 10.9.2 
As in Section 10.2, the universal mapping property determines the inverse limit up to 
isomorphism. 
If the ordering on I is the equality relation, the the inverse limit reduces to a product. 
In category theory, the limit is a generalization of the inverse limit. 
A popular notation for the inverse limit is 


The inverse limit is sometimes called a projective limit. 

We constructed the direct limit of a family of modules by forming a quotient of the 
direct sum. By duality, we expect that the inverse limit involves a submodule of the direct 
product. 


10.9.6 Theorem 


If {M;, h(t, 7),7,7 © I} is an inverse system of R-modules, then the inverse limit of the 
system exists. 


Proof. We take M to be the set of all « = (a;,i € J) in the direct product [],;.M; such 
that h(i,j)x; = x; whenever i < j. Let p; be the restriction to M of the projection on 
the i‘” factor. Then h(i, j)pjz = h(i, j)x; = 2; = pix, so the p; are compatible. Given 
compatible f;: N — Mj, let f be the product of the f;, that is, fa = (fix,i € I). By 
compatibility, h(i, j) fj; = fiw for i < j, so f maps []; M; into M. By definition of f we 
have p;f = f;, and the result follows. d& 


10.9.7 Example 


Recall from Section 7.9 that a p-adic integer can be represented as ag + aip+ agp? +---, 
where the a; belong to {0,1,...,p—1}. If we discard all terms after a,_1p"—+, r = 1,2,..., 
we get the ring Z,r. These rings form an inverse system; if x € Zp» and r < s, we take 
h(r,s)a to be the residue of mod p’. The inverse limit of this system is the ring of 
p-adic integers. 


Problems For Section 10.9 


1. In Theorem (10.9.6), why can’t we say “obviously”, since direct limits exist in the 
category of modules, inverse limits must also exist by duality. 


2. Show that in the category of modules over a commutative ring, the tensor product 
commutes with direct limits. In other words, 


lim (M@N;)=M@lim N; 


assuming that the direct limit of the N; exists. 


3. For each n = 1,2,..., let A, be an R-module, with A, C Ag C Ag C---. Take h(i, 7), 
i <j, to be the inclusion map. What is the direct limit of the A,,? (Be more explicit 
than in (10.9.4).) 
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4. Suppose that A, B and C are the direct limits of direct systems {A;}, {Bi} and {Ci} 
of R-modules. Assume that for each 7, the sequence 


peter: eeeceer oy 


is exact. Give an intuitive argument to suggest that the sequence 


Vee ee eae 


is exact. Thus direct limit is an exact functor. 


[A lot of formalism is being suppressed here. We must make the collection of direct 
systems into a category, and define a morphism in that category. This forces compatibility 
conditions on the f; and g: f;ha(i,7) = heli,a) fi, gihelt,7) = holt, j)gi. The direct 
limit functor takes a direct system to its direct limit, but we must also specify what it 
does to morphisms. ] 

A possible strategy is to claim that since an element of a direct sum has only finitely 
many nonzero components, exactness at B is equivalent to exactness at each B;. This 
is unconvincing because the direct limit is not simply a direct sum, but a quotient of a 
direct sum. Suggestions are welcome! 

Problems 5 and 6 give some additional properties of direct products. 


5. Show that 
Hompr(@Aj, B) = | [ Homa(Aj, B). 


6. Show that 
Hom,(A, | [ Bi) & [| Homa(A, Bi). 


7. If M isa nonzero R-module that is both projective and injective, where R is an integral 
domain that is not a field, show that Homr(M, R) = 0. 

8. Let R be an integral domain that is not a field. If M is an R-module that is both 
projective and injective, show that M = 0. 


Appendix to Chapter 10 


We have seen that an abelian group is injective if and only if it is divisible. In this 
appendix we give an explicit characterization of such groups. 


A10.1 Definitions and Comments 


Let G be an abelian group, and T the torsion subgroup of G (the elements of G of 
finite order). Then G/T is torsion-free, since n(z + T) = 0 implies nz € T, hence 
x € T. If p is a fixed prime, the primary component G, associated with p consists 
of all elements whose order is a power of p. Note that Gp is a subgroup of G, for if 
p'a = p™b = 0, n > m, then p"(a — b) = 0. (We use the fact that G is abelian; for 
example, 3(a — b) =a—b+a—b+a-—b=a+a+a-—b—b-b.) 
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A10.2 Proposition 


The torsion subgroup T’ is the direct sum of the primary components Gp, p prime. 


Proof. Suppose x has order m = Nes Pp; . If m; = m/p;', then the greatest common 
divisor of the m,; is 1, so there are integers a1,...,a, such that aym, +---+ apm, = 1. 
Thus « = lz = eee a;(m;x), and (by definition of m;) m;x has order p;* and therefore 
belongs to the primary component G,,. This proves that G is the sum of the G,. To show 
that the sum is direct, assume 0 #4 x € GpM 1,4, Gq. Then the order of x is a power 
of p and also a product of prime factors unequal to p, which is impossible. For example, 
if y has order 9 and z has order 125, then 9(125)(y + z) = 0, so the order of y + z is of 
the form 3755. & 


A10.3 Definitions and Comments 


A Priifer group, also called a quasicyclic group and denoted by Z(p™), is a p-primary 
component of Q/Z, the rationals mod 1. Since every element of Q/Z has finite order, it 
follows from (A10.2) that 


Q/Z = PHAp~). 


Now an element of Q/Z whose order is a power of p must be of the form a/p" + Z for 
some integer a and nonnegative integer r. It follows that the elements a, = 1/p" + Z, 
r=1,2,..., generate Z(p~). These elements satisfy the following relations: 


pa; = 0, pag = a4,...,PAr+1 = Gp,... 


A10.4 Proposition 


Let H be a group defined by generators b,,b2,... and relations pb; = 0, pbe = 0,..., 
pb;+41 = b,,.... Then H is isomorphic to Z(p~). 


Proof. First note that the relations imply that every element of H is an integer multiple 
of some b;. Here is a typical computation: 


4b7 + 6619 + 2b14 = 4 (pbs) + 6(pbi1) + 2b14 
= +++ = A(p"by4) + 6(p*by4) + 2bia = (4p" + 6p* + 2)bia. 


By (5.8.5), there is an epimorphism f: H — Z(p™), and by the proof of (5.8.5), we can 
take f(b;) = a; for all 7. To show that f is injective, suppose f(cb;) = 0 where c € Z. 
Then cf(b;) = ca; = 0, so c/p' € Z, in other words, p’ divides c. ( We can reverse 
this argument to conclude that f(cb;) = 0 iff p’ divides c.) But the relations imply that 
pb; = 0, and since c is a multiple of p’, we have chi = 0. & 
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A10.5 Proposition 


Let G be a divisible abelian group. Then its torsion subgroup T is also divisible. Moreover, 
G can be written as T @ D, where D is torsion-free and divisible. 


Proof. If x € T and 0 #4 n € Z, then for some y € G we have ny = x. Thus in the 
torsion-free group G/T we have n(y+7) =x+T = 0. But then ny € T, so (as in 
(A10.1)) y € T and T is divisible, hence injective by (10.6.6). By (10.6.2), the exact 
sequence 0 — T — G— G/T — 0 splits, soG = T@G/T. Since G/T is torsion-free and 
divisible (see (10.6.5)), the result follows. d& 


We are going to show that an abelian group is divisible iff it is a direct sum of copies 
of Q (the additive group of rationals) and quasicyclic groups. To show that every divisible 
abelian group has this form, it suffices, by (A10.2), (A10.5) and the fact that a direct sum 
of divisible abelian groups is divisible, to consider only two cases, G torsion-free and G a 


p-group. 


A10.6 Proposition 


If G is a divisible, torsion-free abelian group, then G is isomorphic to a direct sum of 
copies of Q. 


Proof. The result follows from the observation that G can be regarded as a Q-module, 
that is, a vector space over Q; see Section 10.7, Problem 7. & 


For any abelian group G, let G[n] = {x € G: nx = 0}. 


A10.7 Proposition 


Let G and H be divisible abelian p-groups. Then any isomorphism y of G[p] and H[p] 
can be extended to an isomorphism w of G and H. 


Proof. Our candidate ~ arises from the injectivity of H, as the diagram below indicates. 


ici 


base 


0 = Gp ——=G 


The map from G[p] to G is inclusion, and the map from G[p] to H is the composition of y 
and the inclusion from H[{p] to H. Suppose that « € G and the order of z is |a| = p”. 
We will prove by induction that (x) = 0 implies « = 0. If n = 1, then x € Glpl, 
so w(x) = v(x), and the result follows because y is injective. For the inductive step, 
suppose |z| = p”*! and w(x) = 0. Then |pa| = p” and a(px) = py(x) = 0. By induction 
hypothesis, pr = 0, which contradicts the assumption that x has order p”*!. 

Now we prove by induction that w is surjective. Explicitly, if y © H and |y| = p”, 
then y belongs to the image of w. If n = 1, then y € H[p] and the result follows because y 
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mr 


is surjective. If |y| = p"*+, then p"y € H[p], so for some x € G[p] we have v(x) = p 
Since G is divisible, there exists g € G such that p"g = x. Then 


p’(y — ¥(g)) = vy — W(p"g) = vy — W(x) = p"y — v(x) = 0. 


By induction hypothesis, there is an element z € G such that ¢(z) = y — ~(g). Thus 
Vgtz=y & 


y. 


A10.8 Theorem 


An abelian group G is divisible if and only if G is a direct sum of copies of Q and 
quasicyclic groups. 


Proof. Suppose that G is such a direct sum. Since Q and Z(p~) are divisible [Z(p™) is a 
direct summand of the divisible group Q/Z], and a direct sum of divisible abelian groups 
is divisible, G must be divisible. Conversely, assume G divisible. In view of (A10.6) and 
the discussion preceding it, we may assume that G is a p-group. But then G[p] is a vector 
space over the field F, = Z/pZ; the scalar multiplication is given by (n + pZ)g = ng. 
Since pg = 0, scalar multiplication is well-defined. If the dimension of G[p] over F, is d, 
let H be the direct sum of d copies of Z(p~). An element of order p in a component of the 
direct sum is an integer multiple of 1/p+Z, and consequently H[p] is also a d-dimensional 
vector space over F,. Thus Gp] is isomorphic to H[p], and it follows from (A10.7) that 
G is isomorphic to H. & 


Supplement: The Long Exact 
Homology Sequence and 
Applications 


S1. Chain Complexes 


In the supplement, we will develop some of the building blocks for algebraic topology. 
As we go along, we will make brief comments [in brackets] indicating the connection 
between the algebraic machinery and the topological setting, but for best results here, 
please consult a text or attend lectures on algebraic topology. 


$1.1 Definitions and Comments 


A chain complex (or simply a complex) C, is a family of R-modules C,, n € Z, along 
with R-homomorphisms d,,: C;, — Cy _1 called differentials, satisfying dydn+, = 0 for 
all n. A chain complex with only finitely many C,,’s is allowed; it can always be extended 
with the aid of zero modules and zero maps. [In topology, C,, is the abelian group of n- 
chains, that is, all formal linear combinations with integer coefficients of n-simplices in a 
topological space X. The map d,, is the boundary operator, which assigns to an n-simplex 
an n — 1-chain that represents the oriented boundary of the simplex.] 

The kernel of d, is written Z,,(C,) or just Z,; elements of Z,, are called cycles in 
dimension n. The image of d,+1 is written B,(C,) or just B,; elements of B,, are called 
boundaries in dimension n. Since the composition of two successive differentials is 0, it 
follows that B, C Z,. The quotient Z,,/B, is written H,,(C,) or just H,,; it is called the 
n'” homology module (or homology group if the underlying ring R is Z). 

[The key idea of algebraic topology is the association of an algebraic object, the col- 
lection of homology groups H,,(X), to a topological space X. If two spaces X and Y 
are homeomorphic, in fact if they merely have the same homotopy type, then H,,(X) 
and H,,(Y) are isomorphic for all n. Thus the homology groups can be used to distin- 
guish between topological spaces; if the homology groups differ, the spaces cannot be 
homeomorphic. ] 

Note that any exact sequence is a complex, since the composition of successive maps 
is 0. 


$1.2 Definition 


A chain map f: C, — D, from a chain complex C, to a chain complex D,, is a collection 
of module homomorphisms f,: C, — Dn, such that for all n, the following diagram is 
commutative. 


Ch Dn 
“| | 
Cn-1 — > Dn-1 

fn=1 


We use the same symbol d,, to refer to the differentials in C, and D,. 

(If f: X — Y is a continuous map of topological spaces and ¢ is a singular n-simplex 
in X, then fy(o) = foo isa singular n-simplex in Y, and fy extends to a homomorphism 
of n-chains. If we assemble the fy’s for n = 0,1,..., the result is a chain map.| 


$1.3 Proposition 


A chain map f takes cycles to cycles and boundaries to boundaries. Consequently, the 
map Zn + Bn(Cx) > fr(Zn) + Bn(Dx) is a well-defined homomorphism from H,,(C,) to 
H,,(D,.). It is denoted by H,,(f). 


Proof. If z € Z,(Cx), then since f is a chain map, dn fn(z) = fn—idn(z) = fn—1(0) = 0. 
Therefore f(z) € Zn(D.). If b € By(C.), then dniic = b for some c € Cy4i. Then 
fr(b) = fn(dn410) = dn41fn41C, 80 frn(d) € B,(D,). & 


$1.4 The Homology Functors 


We can create a category whose objects are chain complexes and whose morphisms are 
chain maps. The composition gf of two chain maps f: C, — D, and g: D, — E, is the 
collection of homomorphisms g,, f,, n € Z. For any n, we associate with the chain complex 
C, its n'” homology module H,,(C.), and we associate with the chain map f: C, > D, 
the map H,(f): Hn(Cs) — Hy(D.) defined in (S1.3). Since Hn(gf) = Hn(g)Hn(f) and 
H,,(1c,) is the identity on H,(C,), Hn is a functor, called the n“” homology functor. 


$1.5 Chain Homotopy 


Let f and g be chain maps from C, to D,. We say that f and g are chain homotopic 
and write f ~ g if there exist homomorphisms h,: Cy, — Dy+1 such that fn —- gn = 
dnsilhn + hn_idn; see the diagram below. 


dn 
Ch = Ch— 1 


Rn 
a |44-% 
Rn-1 


Dny+1 ——> D,, 
dn4i 


(If f and g are homotopic maps from a topological space X to a topological space Y, 
then the maps fy and gy (see the discussion in (51.2)) are chain homotopic, ] 


$1.6 Proposition 
If f and g are chain homotopic, then H,(f) = Hn(g). 


Proof. Let z € Z,(C,). Then 
fn(Z) — Gn(z) = (dntithn + hn—1dn)z € Bn(D.) 
since d,z = 0. Thus fy(z) + Br(D.s) = gn(z) + Bn(D.), in other words, H,(f) = 


S2. The Snake Lemma 


We isolate the main ingredient of the long exact homology sequence. After an elaborate 
diagram chase, a homomorphism between two modules is constructed. The domain and 
codomain of the homomorphism are far apart in the diagram, and the arrow joining them 
tends to wiggle like a serpent. First, a result about kernels and cokernels of module 
homomorphisms. 


S2.1 Lemma 


Assume that the diagram below is commutative. 


A= sB 
| | 
Osa 


(i) f induces a homomorphism on kernels, that is, f(ker d) C ker e. 


(ii) g induces a homomorphism on cokernels, that is, the map y +imd — g(y) +ime, 
y € C, is a well-defined homomorphism from coker d to coker e. 

(iii) If f is injective, so is the map induced by f, and if g is surjective, so is the map 
induced by g. 


Proof. (i) If x € A and d(x) = 0, then ef(x) = gd(x) = g0 = 0. 

(ii) If y € imd, then y = dz for some « € A. Thus gy = gdx = efx € ime. Since g is 
a homomorphism, the induced map is also. 

(iii) The first statement holds because the map induced by f is simply a restriction. 
The second statement follows from the form of the map induced by g. & 


Now refer to our snake diagram, Figure $2.1. Initially, we are given only the second 
and third rows (ABEO and OCDF), along with the maps d, e and h. Commutativity of the 
squares ABDC and BEFD is assumed, along with exactness of the rows. The diagram is 
now enlarged as follows. Take A’ =ker d, and let the map from A’ to A be inclusion. Take 
C" = coker d, and let the map from C to C” be canonical. Augment columns 2 and 3 in 
a similar fashion. Let A’ > B’ be the map induced by f on kernels, and let C’ — D’ be 
the map induced by g on cokernels. Similarly, add B’ > E’ and D’ > F’. The enlarged 
diagram is commutative by (S2.1), and it has exact columns by construction. 


= Dae he. 
wae gry 
a— 
= meee 
is A’ > B’ > FE! 
| 
Y Y 
f 
| A B—>E 0 
| 
| d e h 
Y Y 
Y g t 
N 0 C D F 
a 
SS 
Mie: 
a 
Sea Y 


C’ > D' > fF’ 


Figure 52.1 


S2.2 Lemma 


The first and fourth rows of the enlarged snake diagram are exact. 


Proof. This is an instructive diagram chase, showing many standard patterns. Induced 
maps will be denoted by an overbar, and we first prove exactness at B’. If x € A’ = ker 
dand y = fx = fax, then sy = sfx = 0, so y Cker 3. On the other hand, if y € B’ C B 
and sy = sy = 0, then y = fx for some x € A. Thus 0 = ey = efx = gdz, and since g is 
injective, dx = 0. Therefore y = fx with x € A’, and y €im f. 

Now to prove exactness at D’, let x € C. Then t(gz +ime) = tgr +imh = 0 by 
exactness of the third row, so im g Cker f. Conversely, if y € D and t(y+ime) = 
ty +imh = 0, then ty = hz for some z € E. Since s is surjective, z = sx for some x € B. 
Now 


ty = hz =hsx = tex 
so y—ex € kert =img, say y— ex = gu,w € C. Therefore 
y +ime = 9(w +imd) 


andy+imecimg. & 


$2.3. Remark 


Sometimes an even bigger snake diagram is given, with column 1 assumed to be an exact 
sequence 


0 —+> 4’ —> A—*> Cc —+c' —>0 


and similarly for columns 2 and 3. This is nothing new, because by replacing modules 
by isomorphic copies we can assume that A’ is the kernel of d, C’ is the cokernel of d, 
A’ = Ais inclusion, and C — C’ is canonical. 


$2.4 The Connecting Homomorphism 


We will now connect EF’ to C’ in the snake diagram while preserving exactness. The idea 
is to zig-zag through the diagram along the path E’EBDCC". 

Let z € E’ C E; Since s is surjective, there exists y € B such that z = sy. Then 
tey = hsy = hz = 0 since E’ = kerh. Thus ey € kert = img, so ey = gx for some x € C. 
We define the connecting homomorphism 0: E' > C’ by 0z = x +imd. Symbolically, 


O=[g 'oeos 3 
where the brackets indicate that Oz is the coset of x in C’ = C/imd. 

We must show that 0 is well-defined. Suppose that y’ is another element of B with 
sy =z. Then y—y’ € kers =imf, so y—y’ = fu for some u € A. Thus e(y — y’) = 
efu = gdu. Now we know from the above computation that ey = gx for some x € C, and 
similarly ey’ = gx’ for some x’ € C. Therefore g(a — 2’) = gdu, so x — x’ — du € ker g. 
Since g is injective, x — 2’ = du, sox +imd = 2’ +imd. Thus Qz is independent of the 
choice of the representatives y and x. Since every map in the diagram is a homomorphism, 
so is 0. 


$2.5 Snake Lemma 


The sequence 


SI 
wl 
fo) 
| 
a 


A’ B' E’ Cc’ D' F’ 


is exact. 


Proof. In view of (S2.2), we need only show exactness at E’ and C’. If z= sy, ye B’ = 
kere, then ey = 0, so Oz = 0 by definition of 0. Thus ims C ker 0. Conversely, assume 
Oz = 0, and let x and y be as in the definition of 0. Then x = du for some u € A, hence 
gx = gdu = efu. But gx = ey by definition of 0, so y — fu € kere = B’. Since z = sy by 
definition of 0, we have 


z= s(y— fut fu) = s(y— fu) € ims. 


To show exactness at C’, consider an element 0z in the image of 0. Then 0z = x+imd, 
so gOz = gxt+ime. But gx = ey by definition of 0, so gOz = 0 and Oz € ker g. Conversely, 


suppose x € C and G(x +imd) = gz +ime = 0. Then gx = ey for some y € B. If z = sy, 
then hsy = tey = tgx = 0 by exactness of the third row. Thus z € E” and (by definition 
of 0) we have 0z = x + imd. Consequently, «+imd€imd. & 


S3. The Long Exact Homology Sequence 


$3.1 Definition 
We say that 


(=o pe 


E, 0 


where f and g are chain maps, is a short exact sequence of chain complezes if for each n, the 
corresponding sequence formed by the component maps fp: Cy, > Dy and gn: Dn — En, 
is short exact. We will construct connecting homomorphisms O,: Hy( Es) > Hn—1(Cx) 
such that the sequence 


se f 


is exact. [We have taken some liberties with the notation. In the second diagram, f 
stands for the map induced by f, on homology, namely, H,,(f); similarly for g.] The 
second diagram is the long exact homology sequence, and the result may be summarized 
as follows. 


$3.2 Theorem 


A short exact sequence of chain complexes induces a long exact sequence of homology 
modules. 


Proof. This is a double application of the snake lmma. The main ingredient is the 
following snake diagram. 


d d i 


0 —— +> Z,-1(C,) Zn-1(Dx) ————=— n—1( Ex) 


The horizontal maps are derived from the chain maps f and g, and the vertical maps are 
given by d(a, + B,) = dx,. The kernel of a vertical map is {a + Bn: tn € Zr} = Hn, 
and the cokernel is Z,-1/Bn—1 = Hn—1. The diagram is commutative by the definition 
of a chain map. But in order to apply the snake lemma, we must verify that the rows 
are exact, and this involves another application of the snake lemma. The appropriate 
diagram is 


where the horizontal maps are again derived from f and g. The exactness of the rows of 
the first diagram follows from (S2.2) and part (iii) of (S2.1), shifting indices from n to 
n+1lasneeded. & 


$3.3. The connecting homomorphism explicitly 

If z € H,(E,), then z = z,+B,(E,) for some z, € Z,(E,). We apply (S2.4) to compute 
Oz. We have 2, + Bn(Ex) = gn(Yn + Bn(D.)) for some yn € Dn. Then dyn € Zn—1(D.) 
and dyn = fn—1(@n-1) for some %p-1 © Zn—1 (Cx). Finally, 0z = a,_-1 + Byn_1(C,). 
$3.4 Naturality 


Suppose that we have a commutative diagram of short exact sequences of chain complexes, 
as shown below. 


0 >C > D > FE > 0 


a 


0 Cc’ D' E’ 0 


Then there is a corresponding commutative diagram of long exact sequences: 
6a H,(Cx) ——> H,,(D.) —> Hn(Es) es poi.) = Se 
SS Ha(Cl) —> Hn(Di) —> Hn( EL) "> Hna(Cl) —> +» 


Proof. The homology functor, indeed any functor, preserves commutative diagrams, so 
the two squares on the left commute. For the third square, an informal argument may 
help to illuminate the idea. Trace through the explicit construction of 0 in ($3.3), and 
let f be the vertical chain map in the commutative diagram of short exact sequences. The 
first step in the process is 


Ze Balla) Gat BalDe) 
By commutativity, 
fin + Bn( EX) > fm + Bn(D.). 
Continuing in this fashion, we find that if Oz = v,-1 + Bn_1(C.), then 


O( fz) = fen-1 + Bn-1(CL) = f(@n-1 + Bn-1(C«)) = f(0z). & 


A formal proof can be found in “An Introduction to Algebraic Topology” by J. Rotman, 
page 95. 


S4. Projective and Injective Resolutions 


The functors Tor and Ext are developed with the aid of projective and injective resolutions 
of a module, and we will now examine these constructions. 


$4.1 Definitions and Comments 


A left resolution of a module M is an exact sequence 


Py, Pi Po M 0 


A left resolution is a projective resolution if every P; is projective, a free resolution if 
every P; is free. By the first isomorphism theorem, M is isomorphic to the cokernel of the 
map P, — Po, so in a sense no information is lost if M is removed. A deleted projective 
resolution is of the form 


> Py > P; > Po >) 


M 


and the deleted version turns out to be more convenient in computations. Notice that in 
a deleted projective resolution, exactness at Pp no longer holds because the map P; — Po 
need not be surjective. Resolutions with only finitely many P,,’s are allowed, provided 
that the module on the extreme left is 0. The sequence can then be extended via zero 
modules and zero maps. 

Dually, a right resolution of M is an exact sequence 


0 > VM > Eo > By > Ba++; 


we have an injective resolution if every E; is injective, . A deleted injective resolution has 
the form 


0 > Eo > by > Ky Se 


| 


M 


Exactness at Eo no longer holds because the map Ey — FE, need not be injective. 
We will use the notation P, — M for a projective resolution, and M — E, for an 
injective resolution. 


$4.2 Proposition 


Every module M has a free (hence projective) resolution. 


Proof. By (4.3.6), M is a homomorphic image of a free module Fo. Let Ko be the kernel 
of the map from Fo onto M. In turn, there is a homomorphism with kernel kK, from a 
free module F, onto Ko, and we have the following diagram: 


0 iy Fi Ko Fo M 0 


Composing the maps F; — Ko and Ko — Fo, we get 


0 >ky > Fy > Fo > Vf >() 


which is exact. But now we can find a free module F5 and a homomorphism with kernel Ko 
mapping F) onto K,. The above process can be iterated to produce the desired free 
resolution. d& 


Specifying a module by generators and relations (see (4.6.6) for abelian groups) in- 
volves finding an appropriate Fo and Ko, as in the first step of the above iterative process. 
Thus a projective resolution may be regarded as a generalization of a specification by gen- 
erators and relations. 

Injective resolutions can be handled by dualizing the proof of (54.2). 


$4.3 Proposition 
Every module M has an injective resolution. 


Proof. By (10.7.4), M can be embedded in an injective module Eg. Let Co be the cokernel 
of M — Eo, and map Eo canonically onto Cp. Embed Co in an injective module Fy, and 
let C, be the cokernel of the embedding map. We have the following diagram: 


0 > M > Eo >Co > By >C; >() 


Composing Eg — Cop and Co — FE}, we have 


0 > MVM > Eo > By >C; >() 


which is exact. Iterate to produce the desired injective resolution. @& 


S5. Derived Functors 
S5.1 Left Derived Functors 


Suppose that F is a right exact functor from modules to modules. (In general, the 
domain and codomain of F' can be abelian categories, but the example we have in mind is 
M®pR-—.) Given a short exact sequence 0 — A — B — C — 0, we form deleted projective 
resolutions Pa, — A, Pg. — B, Pox — C. It is shown in texts on homological algebra 
that it is possible to define chain maps to produce a short exact sequence of complexes 
as shown below. 


0 >A > B >C >0 
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The functor F' will preserve exactness in the diagram, except at the top row, where we only 
have FA — FB — FC — 0 exact. But remember that we are using deleted resolutions, 
so that the first row is suppressed. The left derived functors of F are defined by taking 
the homology of the complex F'(P), that is, 


(InF)(A) = Hn[F(Pas)]- 


The word “left” is used because the L,,F are computed using left resolutions. It can 
be shown that up to natural equivalence, the derived functors are independent of the 
particular projective resolutions chosen. By (53.2), we have the following long exact 
sequence: 


-- —$ (L,F)(A) —> (Ln F)(B) —> (Ln F)(C) —2> (Ln—1F)(A) —> + 


$5.2 Right Derived Functors 


Suppose now that F is a left exact functor from modules to modules, e.g., Homr(M, —). 
We can dualize the discussion in ($5.1) by reversing the vertical arrows in the commuta- 
tive diagram of complexes, and replacing projective resolutions such as P4, by injective 
resolutions E'4,. The right derived functors of F are defined by taking the homology of 
F(£). In other words, 


(R"P)(A) = A" [F(Eax)] 


where the superscript n indicates that we are using right resolutions and the indices are 
increasing as we move away from the starting point. By (S3.2), we have the following 
long exact sequence: 


-- —$ (R"F)(A) —> (R"F)(B) —> (R"F)(C) —2> (R™1F)(A) —>--- 
$5.3. Lemma 


(LoF)(A) = F(A) = (R°F)(A) 


Proof. This is a good illustration of the advantage of deleted resolutions. If P, — A, we 
have the following diagram: 


F(P,) —> F(Po) —+0 


F(A) 
| 
0 


The kernel of F(Po) — 0 is F(Po), so the 0°" homology module (LoF)(A) is F(Po) mod 
the image of F'(P,) — F(Po) [=the kernel of F(Py)) — F(A).] By the first isomorphism 
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theorem and the right exactness of F', (Lo F')(A) = F(A). To establish the other isomor- 
phism, we switch to injective resolutions and reverse arrows: 


(= Fo) == PED) 


| 


F(A) 


| 


0 


The kernel of F'(E£o) — F'(£1) is isomorphic to F(A) by left exactness of F’, and the image 
of 0 F(£p) is 0. Thus (R°F)(A) © F(A). & 


S5.4 Lemma 


If A is projective, then (L,,F')(A) = 0 for every n > 0; if A is injective, then (R"F)(A) =0 
for every n > 0. 


Proof. If A is projective [resp. injective], then 0 — A — A — 0 is a projective [resp. 
injective] resolution of A. Switching to a deleted resolution, we have 0 > A — 0 in each 
case, and the result follows. & 


S5.5 Definitions and Comments 


If F is the right exact functor M®p —, the left derived functor L,, F is called Tor? (M, —). 
If F is the left exact functor Homr(M,—), the right derived functor R"F is called 
Extp(M, —). It can be shown that the Ext functors can also be computed using pro- 
jective resolutions and the contravariant hom functor. Specifically, 


Ext”,(M, N) = [R"” Homp(—, N)](M). 


A switch from injective to projective resolutions is a simplification, because projective 
resolutions are easier to find in practice. 

The next three results sharpen Lemma $5.4. [The ring R is assumed fixed, and we 
write ®p simply as ®. Similarly, we drop the R in Tor” and Extr. When discussing Tor, 
we assume R commutative.| 


$5.6 Proposition 

If M is an R-module, the following conditions are equivalent. 
(i) M is flat; 

(ii) Tor, (M,N) = 0 for all n > 1 and all modules N; 

(iii) Tor, (17, N) = 0 for all modules N. 
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Proof. (i) implies (ii): Let P, — N be a projective resolution of N. Since M @ — is an 
exact functor (see (10.8.1)), the sequence 


3 MeEP-MeEeR-MeEN-—O0D0 


is exact. Switching to a deleted resolution, we have exactness up to M ® P, but not at 
M ® Po. Since the homology modules derived from an exact sequence are 0, the result 
follows. 

(ii) implies (iii): Take n = 1. 

(iii) implies (i): If 0 — A— B — C — 0isa short exact sequence, then by (S5.1), we 
have the following long exact sequence: 


---Tor;(M, C) > Toro(M, A) — Torg(M, B) — Toro(M,C) - 0. 
By hypothesis, Tor;(M,C) = 0, so by (S5.3), 
0- MEAS MEB—MEAC—0 


is exact, and therefore M is flat. o& 


$5.7 Proposition 


If M is an R-module, the following conditions are equivalent. 


(i) M is projective; 
(ii) Ext"(M, N) = 0 for all n > 1 and all modules N; 
(iii) Ext’(M, N) = 0 for all modules N. 


Proof. (i) implies (ii): By ($5.4) and ($5.5), Ext"(M,N) = [Ext"(—,N)](M) = 0 for 
n> 1. 

(ii) implies (iii): Take n = 1. 

(iii) implies (i): Let 0 —- A — B— M — 0 be a short exact sequence. If N is any 
module, then using projective resolutions and the contravariant hom functor to construct 
Ext, as in ($5.5), we get the following long exact sequence: 


0 > Ext®(M, N) > Ext®(B, N) — Ext°(A, N) > Ext?(M,N) > --- 
By (iii) and (85.3), 
0 — Hom(M, N) — Hom(B, N) — Hom(A, N) — 0 
is exact. Take N = A and let g be the map from A to B. Then the map g* from Hom(B, A) 
to Hom(A, A) is surjective. But 14 € Hom(A, A), so there is a homomorphism f: B — A 


such that g*(f) = fg = 1a. Therefore the sequence 0 > A > B — M — 0 splits, so by 
(10.5.3), M is projective. d& 
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$5.8 Corollary 

If N is an R-module, the following conditions are equivalent. 
(a) N is injective; 

(b) Ext”(M, N) = 0 for all n > 1 and all modules M; 

(c) Ext'(M, N) = 0 for all modules M. 


Proof. Simply saying “duality” may be unconvincing, so let’s give some details. For (a) 
implies (b), we have [Ext”(M,—)|(N) = 0. For (c) implies (a), note that the exact 
sequence 0 + N — A — B-— 0 induces the exact sequence 


0 > Ext?(M, N) > Ext°(M, A) > Ext°(M, B) > 0. 
Replace Ext°(M, N) by Hom(M, N), and similarly for the other terms. Then take M = B 
and proceed exactly as in (S5.7). & 
S6. Some Properties of Ext and Tor 
We will compute Ext?,(A, B) and Tor!(A, B) in several interesting cases. 


S6.1 Example 


We will calculate Ext7(Zm,B) for an arbitrary abelian group B. To ease the notational 
burden slightly, we will omit the subscript Z in Ext and Hom, and use = (most of the 
time) when we really mean ©. We have the following projective resolution of Z,,: 


0 >Z—“">+Z a Digs > 0 


where the m over the arrow indicates multiplication by m. Switching to a deleted resolu- 
tion and applying the contravariant hom functor, we get 


0 ——+ Hom(Z, B) —“+ Hom(Z, B) —~0 
Hom(Z, B) 
But by (9.4.1), we have 
Homa(R, B) = B (1) 
and the above diagram becomes 


m 


0 —> B —> B—>0 (2) 


By (95.3), Ext?(Zm,B) = Hom(Z,,B). Now a homomorphism f from Zm to B is 
determined by f(1), and f(m) = mf(1) = 0. If B(m) is the set of all elements of B 
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that are annihilated by m, that is, B(m) = {x € B: ma = 0}, then the map of B(m) to 
Hom(Zm, B) given by « — f where f(1) = 2, is an isomorphism. Thus 


Ext? (Zm,B) = B(m). 
It follows from (2) that 
Ext” (Z», B) =0, n> 2 
and 
Ext! (Zm,B) = ker(B — 0)/im(B — B) = B/mB. 


The computation for n > 2 is a special case of a more general result. 


$6.2 Proposition 
Ext7(A, B) = 0 for all n > 2 and all abelian groups A and B. 


Proof. If B is embedded in an injective module E, we have the exact sequence 


0+-B3E>E/B-50. 
This is an injective resolution of B since F/B is divisible, hence injective; see (10.6.5) 
and (10.6.6). Applying the functor Hom(A, —) = Homz(A, —) and switching to a deleted 


resolution, we get the sequence 


0 ——~ Hom(A, £) ——> Hom(A, E/B) —~>0 


| 


Hom(A, B) 
whose homology is 0 for alln >2. a 
$6.3 Lemma 
Ext9(Z, B) = Homz(Z, B) = B and Ext}(Z, B) = 0. 


Proof. The first equality follows from ($5.3) and the second from (1) of (86.1). Since Z 
is projective, the last statement follows from (S5.7). & 


S6.4 Example 


We will compute Tor“(Zm, B) for an arbitrary abelian group B. As before, we drop the 
superscript Z and write = for =. We use the same projective resolution of Z, as in (S6.1), 
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and apply the functor — ® B. Since R@r B = B by (8.7.6), we reach diagram (2) as 
before. Thus 


Torn(Zm, B) =0, n > 2; 
Tor1(Zm, B) = ker(B > B) = {x € B: maz = 0} = B(m); 
Toro(Zm, B) = Zp ® B= B/mB. 


[To verify the last equality, use the universal mapping property of the tensor product to 
produce a map of Zm ® B to B/mB such that n ® x > n(a+mB). The inverse of this 
map ist+mB->1®2.| 

The result for n > 2 generalizes as in (S6.2): 


$6.5 Proposition 
Tor?(A, B) = 0 for all n > 2 and all abelian groups A and B. 


Proof. B is the homomorphic image of a free module F. If K is the kernel of the ho- 
momorphism, then the exact sequence 0 Kk — F — B = 0 isa free resolution of B. 
[Kk is a submodule of a free module over a PID, hence is free.] Switching to a deleted 
resolution and applying the tensor functor, we get a four term sequence as in (S6.2), and 
the homology must be 0 forn > 2. d& 


S6.6 Lemma 
Tor, (Z, B) = Tor,(A, Z) = 0; Toro(Z, B) =ZeB=B. 


Proof. The first two equalities follow from (55.6) since Z is flat. The other two equalities 
follow from (S5.3) and (8.7.6). & 


$6.7 Finitely generated abelian groups 


We will show how to compute Ext”(A, B) and Tor,,(A, B) for arbitrary finitely generated 
abelian groups A and B. By (4.6.3), A and B can be expressed as a finite direct sum of 
cyclic groups. Now Tor commutes with direct sums: 


Torn (A, 65-1 By) = ja; Torn (A, Bj). 


[The point is that if P), is a projective resolution of B;, then the direct sum of the P;, 
is a projective resolution of @;B,, by (10.5.4). Since the tensor functor is additive on 
direct sums, by (8.8.6(b)), the Tor functor will be additive as well. Similar results hold 
when the direct sum is in the first coordinate, and when Tor is replaced by Ext. (We use 
(10.6.3) and Problems 5 and 6 of Section 10.9, and note that the direct product and the 
direct sum are isomorphic when there are only finitely many factors.)] 

Thus to complete the computation, we need to know Ext(A, B) and Tor(A, B) when 
A=Zor Z, and B=Z or Z,. We have already done most of the work. By ($6.2) and 
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(S6.5), Ext” and Tor, are identically 0 for n > 2. By (S6.1), 


Ext°(Zm,Z) = Z(m) = {x € Z: ma = 0} =0; 
Ext? (Zim, Zn) = Zn(m) = {x € Zn: me = 0} = Za 


where d is the greatest common divisor of m and n. [For example, Z12(8) = {0,3,6,9} = 
Z4. The point is that the product of two integers is their greatest common divisor times 
their least common multiple.| By (S6.3), 


Ext°(Z, Z) = Hom(Z, Z) = Z; Ext°(Z, Zn) = Zn. 
By (S6.1), 


Ext! (Zm,Z) = Z/mZ = Zim; 
Ext! (Zm;Zn) = Zn/mZn = Za 


as above. By (S5.7), 
Ext'(Z, Z) = Ext’(Z,Zn) = 0 
By (S6.4), 


Tor1(Zm, Z) = Tori(Z, Z,) = Z(m) = 0; 
Tori(Zm, Zn) = Zn(m) = Za. 


By (8.7.6) and (S6.4), 


Toro(Z, Z) = Z; 
Toro(Zm, Z) = Z/mZ = Zn; 
Toro(Zm, Zn) = Zn /MZy = Za. 


Notice that Tor; (A, B) is a torsion group for all finitely generated abelian groups A and B. 
This is a partial explanation of the term “Tor”. The Ext functor arises in the study of 
group extensions. 


S7. Base Change in the Tensor Product 


Let M be an A-module, and suppose that we have a ring homomorphism from A to B (all 
rings are assumed commutative). Then B® 4M becomes a B module (hence an A-module) 
via b(b' ® m) = bb! @ m. This is an example of base change, as discussed in (10.8.8). We 
examine some frequently occurring cases. First, consider B = A/I, where I is an ideal 
of A. 


$7.1 Proposition 
(A/I) @4M & M/IM. 
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Proof. Apply the (right exact) tensor functor to the exact sequence of A-modules 


0-I>+A—A/I 50 
to get the exact sequence 
T®s4M>A®4,M > (A/T) OAM 0. 


Recall from (8.7.6) that A @,4 M is isomorphic to M viaa@®m — am. By the first 
isomorphism theorem, (A/I) ®,4 M is isomorphic to M mod the image of the map from 
I®,aM to M. This image is the collection of all finite sums }> a;m; with a; € I and 
m;,€M,whichisIM. & 


Now consider B = $~!A, where S is a multiplicative subset of A. 


$7.2 Proposition 
(S-1A) @4M2=S-1M. 


Proof. The map from S~!A x M to S~!M given by (a/s,m) — am/s is A-bilinear, so by 
the universal mapping property of the tensor product, there is a linear map a: S~!A@, 
M — S~!M such that a((a/s) @ m) = am/s. The inverse map (3 is given by 3(m/s) = 
(1/s) ®@m. To show that ( is well-defined, suppose that m/s = m'/s’. Then for some 
t € S we have ts’m = tsm’. Thus 


1/s @m=ts'/tss' ®m = 1/tss' @ ts'm = 1/tss' @ tsm’ =1/s' @m’. 


Now a followed by 6 takes a/s®m to am/s and then to 1/s@am = a/s®m. On the other 
hand, @ followed by a takes m/s to 1/s®m and then to m/s. Consequently, a and are 
inverses of each other and yield the desired isomorphism of S~'A@, M and S-'M. & 


Finally, we look at B = A[X]. 
$7.3 Proposition 


[X] 4M © M[X] 


where the elements of M[X] are of the form agmo + a1. Xm, + agX2mgq +-+++anX"Mnp, 
a; € A,m, € M,n=0,1,.... 


Proof. This is very similar to ($7.2). In this case, the map a from A[X]®,4 M to M[X] 
takes f(X)@m to f(X)m, and the map 3 from M[X] to A[X]@4M takes X'm to X'@m. 
Here, there is no need to show that ( is well-defined. & 


Chapter - 1 to 
Chapter -10 


Solutions Chapters 1—5 


Section 1.1 


1. 


Under multiplication, the positive integers form a monoid but not a group, and the 
positive even integers form a semigroup but not a monoid. 


. With |a| denoting the order of a, we have |0| = 1, |1| = 6, |2| = 3, [3] = 2, |4| = 3, 


and |5| = 6. 


. There is a subgroup of order 6/d for each divisor d of 6. We have Zg itself (d = 1), 


{0}(d = 6), {0,2,4}(d = 2), and {0,3}(d = 3). 


. S forms a group under addition. The inverse operation is subtraction, and the zero 


matrix is the additive identity. 


. S* does not form a group under multiplication, since a nonzero matrix whose deter- 


minant is 0 does not have a multiplicative inverse. 


. If dis the smallest positive integer in H, then H consists of all multiples of d. For if 


a € H we have x = qd+r where0<r<d. But then r= 2 -—-—qd€ H, sor must be 
0. 


. Consider the rationals with addition mod 1, in other words identify rational numbers 


that differ by an integer. Thus, for example, 1/3 = 4/3 = 7/3, etc. The group is 
infinite, but every element generates a finite subgroup. For example, the subgroup 
generated by 1/3 is {1/3, 2/3, 0}. 


. (ab)™™ = (a™)” (b")™ = 1, so the order of ab divides mn. Thus |ab| = m,n where 


my, divides m and n, divides n. Consequently, 
QUE DER) ead (1) 


If m = m mz, raise both sides of (1) to the power mz to get 6”! = 1. The order of b, 
namely n, must divide mn , and since m and n are relatively prime, n must divide 
n,. But n, divides n, hence n = n,. Similarly, if nm = n,n2 we raise both sides of (1) 
to the power nz and conclude as above that m = m . But then |ab]| = mini = mn, 
as asserted. 


If c belongs to both (a) and () then since c is a power of a and also a power of b, we 


have c™ = c” = 1. But then the order of c divides both m and n, and since m and n are 
relatively prime, c has order 1, i.e., c= 1. 


9. 


10. 


11. 


12. 


13. 


14. 


15. 


Let |a| = m, |b] = n. If [m,n] is the least common multiple, and (m,n) the greatest 
common divisor, of m and n, then [m,n] = mn/(m,n). Examine the prime factoriza- 
tions of m and n: 


my ty 


Rae 
ino DO we Se 
n= (pi? ---pi*)(piy4" «+ pj") = 8! 8 

where ty < ux for 1 <k <i, andt, > uz fori +1 <k <j. 

Now a” has order m/r and 6° has order n/s, with m/r (= r’) and n/s (= s’) relatively 

prime. By Problem 8, a™b® has order mn/rs = mn/(m,n) = [m,n]. Thus given 

elements of orders m and n, we can construct another element whose order is the 

least common multiple of m and n. Since the least common multiple of m, n and 


q is |[m,n],q], we can inductively find an element whose order is the least common 
multiple of the orders of all elements of G. 


Choose an element a that belongs to H but not kK, and an element b that belongs to 
K but not H, where H and K are subgroups whose union is G. Then ab must belong 
to either H or K, say ab = h € H. But then b = a~'h € H, a contradiction. If 
ab =k € K, then a= kb~! € K, again a contradiction. To prove the last statement, 
note that if H UK is a subgroup, the first result with G replaced by H U K implies 
that H = HUK or K = HUA, in other words, K CH or HC K. 


ak™ — 1 if and only if km is a multiple of n, and the smallest such multiple occurs 


when km is the least common multiple of n and k. Thus the order of a* is [n, k]/k. 

Examination of the prime factorizations of n and k shows that [n, k]/k = n/(n,k). 

We have x € A; iff x is a multiple of p;, and there are exactly n/p; multiples of p; 

between 1 and n. Similarly, x belongs to A; A, iff x is divisible by p;p;, and there 

are exactly Parse multiples of p;p; between 1 and n. The same technique works for all 
éPj 

other terms. 

The set of positive integers in {1,2,...,n} and not relatively prime to n is Ul_, Aj, 

so y(n) = n—| Ut, A;|. By the principle of inclusion and exclusion from basic 

combinatorics, 


Us 
i=l 


By Problem 12, 


=) \Ail-— SoA Al + SO Ai Ay A Ag] — +++ + (=D) AL 42 9+ Ay. 
t=1 


t<Jj i<j<k 


1 1 1 
+S¢ Ss, Se en SLE pa pe 
i<jck PiPjPk 
= = 1 1 eee 1 
Thus y(n) = n(1 eho =) (1 oak 
Let G be cyclic of prime order p. Since the only positive divisors of p are 1 and p, the 
only subgroups of G' are G and {1}. 


No. Any non-identity element of G generates a cyclic subgroup H. If H C G, we 
are finished. If H = G, then G is isomorphic to the integers, and therefore has many 
nontrivial proper subgroups. (See (1.1.4) and Problem 6 above.) 


Section 1.2 


Ts 


10. 


11. 


The cycle decomposition is (1,4)(2,6,5); there is one cycle of even length, so the 
permutation is odd. 


. The elements are J, R = (A,B,C,D), R? = (A,C)(B,D), R? = (A,D,C,B), 


F=(B,D), RF =(A,B)(C,D), R?F =(A,C), R?F = (A,D)(B,C). 


. Such a permutation can be written as (1, a1, @2,a3,@4) where (a1, a2, 43, a4) is a per- 


mutation of {2,3,4,5}. Thus the number of permutations is 4! = 24. 


. Select two symbols from 5, then two symbols from the remaining 3, and divide by 2 


since, for example, (1, 4)(3,5) is the same as (3,5)(1,4). The number of permutations 
is 10(3)/2 = 15. 


. For example, (1,2,3)(1,2) = (1,3) but (1,2)(1,2,3) = (2,3). 
. We have V = {IJ,(1,2)(3,4), (1,3)(2,4), (1,4)(2,3)}. Thus V = {J,a,b,c} where 


the product of any two distinct elements from {a,b,c} (in either order) is the third 
element, and the square of each element is J. It follows that V is an abelian group. 


. This follows because the inverse of the cycle (a1, a2,...,@x) is (@x,...,@2, a1). 


. Pick 3 symbols out of 4 to be moved, then pick one of two possible orientations, e.g., 


(1, 2,3) or (1,3,2). The number of 3-cycles in $4 is therefore 4(2) = 8. 


. If 7 is a 3-cycle, then 7? = IJ, so 74 = 7. But 1+ = (1?)?, and 1? € H by hypothesis, 
y' 


so (1)? € H because H is a group. Thus 7 € H. 

There are 5 inversions, 21, 41, 51, 43 and 53. Thus we have an odd number of 
inversions and the permutation 7 = (1,2, 4)(3,5) is also odd. 

This follows because a transposition of two adjacent symbols in the second row changes 
the number of inversions by exactly 1. Therefore such a transposition changes the 
parity of the number of inversions. Thus the parity of 7 coincides with the parity of the 
number of inversions. In the given example, it takes 5 transpositions of adjacent digits 
to bring 24513 into natural order 12345. It also takes 5 transpositions to create 7: 


m = (1,5)(1,4)(1, 2)(3,5)(3, 4) 


Section 1.3 


1. 


If Ha = Hb then a = la = hb for some h € H, so ab-! = h € H. Conversely, if 
ab-' =h € H then Ha = Hhb = Hb. 


. Reflexivity: aa~'=1€ H. 


Symmetry: If ab~' € H then (ab~!)~! = ba! € H. 
Transitivity: If ab~! € H and bce“! € H then (ab~!)(be~!) = ac7! € H. 


3. ab-! © H iff (ab-!)-! = ba! € Hi iff b € Ha. 
4. Ha~! = Hb-! iff a-*(b-1)-! = ad € HH iff aH = bH. 


5. Since a, belongs to both aH and aH, we have a;H = aH because the left cosets 


partition G. 


. There are only two left cosets of H in G; one is H itself, and the other is, say, aH. 


Similarly, there are only two right cosets, H and Hb. Since the left cosets partition G, 
as do the right cosets, aH must coincide with Hb, so that every left coset if a right 
coset. 


. The permutations on the list are e, (1,2,3), (1,3,2), (1,2), (1,3), and (2,3), which 


are in fact the 6 distinct permutations of {1,2,3}. 


. The left cosets of H are H = {e,b}, aH = {a,ab}, and a?H = {a?,a*b}. The right 


cosets of H are H = {e,b}, Ha = {a, ba} = {a,a7b}, and Ha? = {a?, ba?} = {a?, ab}. 


. The computation of Problem 8 shows that the left cosets of H do not coincide with 


the right cosets. Explicitly, aH and a?H are not right cosets (and similarly, Ha and 
Ha? are not left cosets). 


.fm=fdstls+-- D=fa)+fO)+---f)=r4+r4+---r=rn. 
. In Problem 10, the image f(Z) must coincide with Z. But f(Z) consists of all multiples 


of r, and the only way f(Z) can equal Z is for r to be +1. 


. The automorphism group of Z is {1,—I} where (—I)? = J. Thus the automorphisms 


of Z form acyclic group of order 2. (There is only one such group, up to isomorphism.) 


. Reflexivity: x = lvl. Symmetry: If « = hyk, then y = h~'xk~!. Transitivity: if 


hyyky and Yu hoazka, then x = hyhozkoky. 


. Hak is the union over all k € K of the right cosets H(xk), and also the union over 


all h € H of the left cosets (ha) K. 


Section 1.4 


1. 


. f(xy) = arya- 


Define f: Z— Z, by f(a) =the residue class of x mod n. Then f is an epimorphism 
with kernel nZ, and the result follows from the first isomorphism theorem. 


. Define f: Zn —- Zpjm by f(z) = « mod n/m. Then f is an epimorphism with 


kernel Z,,, and the result follows from the first isomorphism theorem. (In the concrete 
example with n = 12, m = 4, we have f(0) = 0, f(1) = 1, f(2) = 2, f(3) =0, f(4) = 1, 
f(5) = 2, f(6) = 0, etc.) 

1 = axa ‘aya! = f(x)f(y), so f is a homomorphism. If b € G, we 
can solve axa! = b for x, namely x = a~'ba, so f is surjective. If ara~t = 1 then 
ax =a, so x =1 and f is injective. Thus f is an automorphism. 


1 


. Note that fas(z) = aba(ab)—' = a(brb-1)a-1 = fa(fo(x)), and y = fa(x) iff e = 


fa-1(y), so that (fa)7! = fy-1. 


. Define UV: G — Inn G, the group of inner automorphisms of G, by U(a) = f,. Then 


W(ab) = far = fa ° fo = VU(a)V(b), so V is a homomorphism (see the solution to 
Problem 4). Since a is arbitrary, V is surjective. Now a belongs to ker W iff f, is the 
identity function, i.e., ava~! = x for all x € G, in other words, a commutes with every 
xin G. Thus ker UV = Z(G), and the result follows from the first isomorphism theorem. 


. If f is an automorphism of Z,, then since 1 generates Z,, f is completely determined by 


m = f(1), and since 1 has order n in Z,, m must have order n as well. But then m is a 
unit mod n (see (1.1.5)), and f(r) = f14+14+---l =fQ)+f()+---f) =rfd) = 


. If g(x) = y, then go fa og7 
. If G is abelian, then f,() = ava~! = aa~!xz = 2. 


rm. Conversely, any unit m mod n determines an automorphism 6(m) = multiplication 
by m. The correspondence between m and @(m) is a group isomorphism because 
O(m m2) = O(m1) o O(mz2). 


. The first assertion follows from the observation that HN is the subgroup generated 


by HUN (see (1.3.6)). For the second assertion, note that if K is a subgroup of G 
contained in both H and N, then K is contained in HNN. 


' maps y to g(axa~') = g(a)ylg(a)] 


-1 


Section 1.5 


1. 


C2 x Cz has 4 elements 1 = (1,1), a = (a,1), 6 = (1,a) and y = (a,a), and the 
product of any two distinct elements from {a, 3,y} is the third. Since each of a, {, 
+ has order 2 (and 1 has order 1), there is no element of order 4 and C2 x C is not 
cyclic. 


. The four group is V = {I, a,b,c} where the product of any two distinct elements from 


{a, b,c} is the third. Therefore, the correspondence 1 > I, a a, B > b, y > cis an 
isomorphism of Cy x Co and V. 


. Let Cp = {1,a} with a? = 1, and C3 = {1,6,b?} with b? = 1. Then (a,b) generates 


Cz x C3, since the successive powers of this element are (a,b), (1,07), (a,1), (1,0), 
(a, b”), and (1,1). Therefore C2 x C3 is cyclic of order 6, i.e., isomorphic to C¢. 


. Proceed as in Problem 3. If a has order n in C, and b has order m in Cin, then (a, b) 


has order nm in Ci, X Cyn, so that Cy, x Cm is cyclic of order nm. 


. Suppose that (a,b) is a generator of the cyclic group C, x C;,. Then a must generate 


C,, and b must generate C’,, (recall that C, x {1} can be identified with C,). But 
(a,b)* = 1 iff a* = b* = 1, and it follows that the order of (a,b) is the least common 
multiple of the orders of a and 8, i.e., the least common multiple of n and m. Since n 
and m are not relatively prime, the least common multiple is strictly smaller than nm, 
so that (a,b) cannot possibly generate C), x Cy, a contradiction. 


. By (1.3.3), G and H are both cyclic. Since p and gq are distinct primes, they are 


relatively prime, and by Problem 4, G x H is cyclic. 


. Define f: H x K — K x H by f(h,k) = (k,h). It follows from the definition of direct 


product that f is an isomorphism. 


. Define fi: Gx Hx K —Gx (Hx K) by fi(g,h, k) = (g, (h, k)), and define fo: G x 


Hx K — (Gx H) x K by fo(g,h, k) = ((g,h),k). It follows from the definition of 
direct product that f; and fo are isomorphisms. 


Section 2.1 


1. Never. If f is a polynomial whose degree is at least 1, then f cannot have an inverse. 


For if f(X)g(X) = 1, then the leading coefficient of g would have to be 0, which is 
impossible. 


10. 


11. 


. If f(X)g(X) = 1, then (see Problem 1) f and g are polynomials of degree 0, in other 


words, elements of R. Thus the units of R[X] are simply the nonzero elements of R. 


. (a) No element of the form a;X + a2X? +--+ can have an inverse. 


(b) For example, 1 — X is a unit because (1— X)(1+ X4+ X74 X3+4+---)=1. 


. Since Z|?] is a subset of the field C of complex numbers, there can be no zero divisors 


in Z[2]. If w is a nonzero Gaussian integer, then w has an inverse in C, but the inverse 
11 


need not belong to Z[i]. For example, (1+ i)~! = 5 — $i. 


. If z= a+ bi with a and 6 integers, then |z|? = a? + 67, so that if z is not zero, we 
must have |z| > 1. Thus if zw = 1, so that |z||w| = 1, we have |z| = 1, and the only 
possibilities are a = 0,b = +1 or a= +1,b = 0. Consequently, the units of Z[i] are 1, 
—1,iand -1. 


. All identities follow directly from the definition of multiplication of quaternions. Al- 


ternatively, (b) can be deduced from (a) by interchanging x; and x2, y and yo, 21 
and z2, and w; and we. Then the second identity of (c) can be deduced by noting 
that the quaternion on the right side of the equals sign in (a) is the conjugate of the 
quaternion on the right side of the equals sign in (b). 


. Multiply identities (a) and (b), and use (c). (This is not how Euler discovered the 


identity; quaternions were not invented until much later.) 


. The verification that End G is an abelian group under addition uses the fact that G is 


an abelian group. The additive identity is the zero function, and the additive inverse 
of f is given by (—f)(a) = —f(a). Multiplication is associative because composition 
of functions is associative. To establish the distributive laws, note that the value of 
(f + g)h at the element a € G is f(h(a)) + g(h(a)), so that (f+ g)h = fh gh. 
Furthermore, the value of f(g +h) at ais f(g(a) + h(a)) = f(g(a)) + f(A(a)) since 
f is an endomorphism. Therefore f(g +h) = fh+ gh. The multiplicative identity is 
the identity function, given by E(a) = a for all a. 


. An endomorphism that has an inverse must be an isomorphism of G with itself. Thus 


the units of the ring End G are the automorphisms of G. 

Use Euler’s identity with x; ly = 2,2, = 2,w, = 5 (34 = 174 2? 4 2? 4 5?) 
and x2 = 1,y2 = 1,22 = 4,we = 6 (54 = 17+ 12 + 47 + 62). The result is 1836 = 
(34)(54) = 412 + 9? + 5? + 72. The decomposition is not unique; another possibility 
is Ly 0,41 0, 21 3,W1 5, £2 0, ya 1, 22 2, We 7. 


In all four cases, sums and products of matrices of the given type are also of that type. 
But in (b), there is no matrix of the given form that can serve as the multiplicative 
identity.. Thus the sets (a), (c) and (d) are rings, but (b) is not. 


Section 2.2 


1. 


By Section 1.1, Problem 6, the additive subgroups of Z are of the form (n) = all 
multiples of n. But if # € (n) and r € Z then ra € (n), so each (n) is an ideal as well. 


. If the n by n matrix A is 0 except perhaps in column k, and B is any n by n matrix, 


then BA is 0 except perhaps in column k. Similarly, if A is 0 off row k, then so is AB. 


3. (a) This follows from the definition of matrix multiplication. 
(b) In (a) we have aj, = 0 for r #k, and the result follows. 


(c) By (b), the i” term of the sum is a matrix with cj, in the ik position, and 0’s 
elsewhere. The sum therefore coincides with C. 


4. The statement about left ideals follows from the formula of Problem 3(c). The result 
for right ideals is proved in a similar fashion. Explicitly, AE; has column i of A as its 
j'” column, with 0’s elsewhere. If A € R, then AE;; has a,j; in the kj position, with 
0’s elsewhere, so if a4; 40 we have AE;;a;,; = Ej. Thus if C € R, then 


a AE ij 0; Ckj =C. 


j=1 


5. If I is a two-sided ideal and A € I with a,, 4 0, then by considering products of the 
form a;,! EpqAEx (which have the effect of selecting an entry of A and sliding it from 
one row or column to another), we can show that every matrix E;; belongs to I. Since 
every matrix is a linear combination of the E;,, it follows that J = M,,(R). 


6. A polynomial with no constant term is of the form a,X + a2X?+---a,X" = Xg(X). 
Conversely, a polynomial expressible as Xg(X) has no constant term. Thus we may 
take f= X. 

7. Let a be a nonzero element of R. Then the principal ideal (a) is not {0}, so (a) = R. 
Thus 1 € (a), so there is an element b € R such that ab = 1. 


8. Since an ideal J is a finite set in this case, it must have a finite set of generators 
%1,..-.,2,. Let d be the greatest common divisor of the x;.. Every element of I is 
of the form a,x”, +--+ + apx%, and hence is a multiple of d). Thus J C (d). But 
d € I, because there are integers a; such that )>,a;7; = d. Consequently, (d) C I. 
(Technically, arithmetic is modulo n, but we get around this difficulty by noting that 
if ab = c as integers, then ab = c modulo n.| 


Section 2.3 


1. Use the same maps as before, and apply the first isomorphism theorem for rings. 

2. If I, is the set of multiples of n > 1 in the ring of integers, then J, is an ideal but not 
a subring (since 1 ¢ I,). Z is a subring of the rational numbers Q but not an ideal, 
since a rational number times an integer need not be an integer. 

3. In parts (2) and (3) of the Chinese remainder theorem, take R = Z and I; = the set of 
multiples of m,. 

4. Apply part (4) of the Chinese remainder theorem with R = Z and I; = the set of 
multiples of m;. 

5. To prove the first statement, define f: R — Ry by f(r1,r2) = ro. Then f is a ring 
homomorphism with kernel R{ and image Rz. By the first isomorphism theorem for 


rings, R/R{ & Ro. A symmetrical argument proves the second statement. In practice, 
we tend to forget about the primes and write R/R, ~ Rg and R/R2 = R,. There is 


also a tendency to identify a ring with its isomorphic copy, and write R/R, = R2 and 
R/R2 = R, This should not cause any difficulty if you add, mentally at least, ” up to 
isomorphism” . 


. The product is always a subset of the intersection, by definition. First consider the 


case of two ideals. Then 1 = a, + a2 for some a, € ,ag € ky. Tf DE WANN, 
then b = b1 = ba, + bag € I;Ig. The case of more than two ideals is handled by 
induction. Note that R = (11 + In)Ul2 + In)-++Un-1 + In) © Ch-++In-1) + In. 
Therefore (1, ---In—1) + In = R. By the n = 2 case and the induction hypothesis, 
Tye In-tIn =  In-1) NIn = LO 1g N+ +N Tn. 


. Let a+Oj,J; map to (1+ 4,0 + Io,c3 + Iz,...,¢€n + In), where the c; are arbitrary. 


Then 1—aeé , anda€é Ih, sol =(l-—a)+ae€h4y. Thus , + Ip = R, and 
similarly [; + J; = R for all i Fj. 


Section 2.4 


1. 


Ifn=rswithr,s >1thenr ¢ (n),s ¢ (n), but rs € (n), so that (n) is not prime. But 
Z/(n) is isomorphic to Zp, the ring of integers modulo n (see Section 2.3, Problem 1). 
If n is prime, then Z,, is a field, in particular an integral domain, hence (n) is a prime 
ideal by (2.4.5). 


. By Problem 1, I is of the form (p) where p is prime. Since Z/(p) is isomorphic to Zp, 


which is a field, (p) is maximal by (2.4.3). 


. The epimorphism ap + a1 X + a2X?* +--+ — ao of F[[X]] onto F has kernel (X), and 


the result follows from (2.4.7). 


. The ideal J = (2, X) is not proper; since 2 € J and 3 € F C F[[X]], we have 1 € I and 


therefore [ = F|[X]]. The key point is that F is a field, whereas Z is not. 


. Suppose that f(X) = a9 + a,X +--- belongs to J but not to (X). Then ap cannot 


be 0, so by ordinary long division we can find g(X) € F[[X]] such that f(X)g(X) = 1. 
But then 1 € J, contradicting the assumption that J is proper. 


. Let f(X) = a, X” + ang 1 X"*1 +--+ jan #0, be an element of the ideal J, with n 


as small as possible. Then f(X) € (X”), and if g(X) is any element of I, we have 
g(X) € (X™) for some m > n. Thus I C (X"). Conversely, if f(X) = X"g(X) € I, 
with g(X) = an + Gn4i1X +--+ jan 4 0,, then as in Problem 5, g(X) is a unit, and 
therefore X” € I. Thus (X”) C J, so that I = (X”), as claimed. 


. f~*(P) is an additive subgroup by (1.3.15), part (ii). If a € f~1(P) and r € R, then 


f(ra) = f(r) f(@) € P, so ra € f-1(P). Thus f~'(P) is an ideal. If ab € f~!(P), then 
f(a)f(b) € P, so either a or b must belong to f~!(P). If f-'(P) = R, then f maps 
eveything in R, including 1, into P; thus f~'(P) is proper. (Another method: As a 
proper ideal, P is the kernel of some ring homomorphism 7. Consequently, f~1(P) is 
the kernel of 70 f, which is also a ring homomorphism. Therefore f~!(P) is a proper 
ideal.) Consequently, f~!(P) is prime. 


. Let S be a field, and R an integral domain contained in S, and assume that R is not 


a field. For example, let R= Z, S=Q. Take f to be the inclusion map. Then {0} is 
a maximal ideal of S, but f~'({0}) = {0} is a prime but not maximal ideal of R. 


9. 


If P= INJ with PC ITand P C J, choose a € I\P andb € J\P. Then ab € INJ = P, 
contradicting the assumption that P is prime. 


Section 2.5 


1. 


Any number that divides a and b divides 6 and r;, and conversely, any number that 
divides b and r; divides a and b. Iterating this argument, we find that gcd(a,b) = 
gcd(b, r1) = ged(r1,r2) = +++ = ged(rj_1, 73) = 75. 


. This follows by successive substitution. We start with r; = rj;-2 — rj-1q;, continue 


with Tj-1 = 15-3 —T5j-29j-1, Tj-2 = 17-4 — Tj-3Gj-2, and proceed up the ladder until 
we have expressed d as a linear combination of a and b. There is an easier way, as 
Problem 3 shows. 


. The first equation of the three describes the steps of the algorithm. We wish to prove 


that ax; + by; = r;, that is, 
a(xj—2 — GXi-1) + O(Yi-2 — GYi-1) = Ni- (1) 


But this follows by induction: if ax;_2 + by;-2 = rj_2g and ax;_1 + byj_-1 = ri-1, then 
the left side of (1) is r;-2—qiri_1, which is r; by definition of the Euclidean algorithm. 


. We have the following table: 
a Gi+1 r% x4 Yi 
—1 — 123 1 0 
0 2 54 0 1 
1 3 15 1 —2 
2 1 9 —3 7 
3 1 6 4 -9 
4 2 3 —7 16 
For example, to go from i = 1 to 7 = 2 we have x2 = 2 — qgu, = 0 — 3(1) = —3, 


y2 = Yo — @y1 = 1 — 3(—-2) = 7, and rg = ro — qari = 54 — 3(15) = 9; also, 
gg = |15/9| = 1. We have axe + byg = 123(—3) + 54(7) = 9 = re, as expected. The 
process terminates with 123(—7) + 54(16) =3 =d. 


. If p is composite, say p = rs with 1 <r <p,1<s <p, then rs is 0 in Z, but r 


and s are nonzero, so Z, is not a field. If p is prime and a is not zero in Z, then the 
greatest common divisor of a and p is 1, and consequently there are integers x and 
y such that ax + py = 1. In Z, this becomes ax = 1, so that every nonzero element 
in Z, has an inverse in Z,, proving that Z, is a field. 


. Since f(X) and g(X) are multiples of d(X), so are all linear combinations a(X) f(X)+ 


b(X)g(X), and consequently I C J. By Problem 2, there are polynomials a(X) and 
b(X) such that a(X) f(X) + b(X)g(X) = d(X), so that d(X) belongs to I. Since I is 
an ideal, every multiple of d(X) belongs to I, and therefore J C I. 
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. If g(X) is another polynomial such that g(a;) = f(a;) for all i, then f and g agree at 


n+1 points, so that f(X)—g(X) has more than n roots in F. By (2.5.3), f(X)—g(X) 
must be the zero polynomial. 


. If F has only finitely many elements a1,...,@,, take f(X) = (X — a,)---(X — ay). 
10. 


Let F' be the complex numbers C. Then every polynomial of degree n has exactly n 
roots, counting multiplicity. Thus if f(a) = 0 at more than n points a, in particular 
if f vanishes at every point of C, then f = 0. More generally, F can be any infinite 
field (use (2.5.3)). 


Section 2.6 


. If r= 0 then J contains a unit, so that 1 € J and J= R. 
. If b ¢ (pi) then b+ (p,) ¥ (pi), so 6+ (pi) has an inverse in R/(p;), say c+ (pi). Thus 


(b+ (pi))(e + (pi)) = 1+ (pi), hence (bc — 1) + (p1) = (pi), so be — 1 € (pi). 


. If be — dp, = 1 then bep2--- py — dpi +++ Pn = p2-++Pn, and since b and p, ---p, belong 


to I, so does p2+-- Pn, contradicting the minimality of n. (If n = 1, then 1 € J, so 
I=R.) 


. Ifa,b€ Rand z,y € J then (ax+by)p; = xpia+ypib. Since x,y € J we have xp; € I 


and yp; € I, so that (ax + by)pi € I, hence ax + by € J. 


_ Ifa € J then xp, € I, so Jp, C I. Now I C (p,) by Problem 3, so if a € I then 


a= xp, for some x € R. But then x € J by definition of J, soa = xp, € Jpy. 


. Since J contains a product of fewer than n primes, J is principal by the induction 


hypothesis. If J = (d) then by Problem 5, J = J(p1). But then I = (dp,), and the 
result follows. (If n = 1, then p; € J, hence 1 € J , so J= Rand I = J(p,) = (p.).) 


. Assume that P C Q. Then p = aq for some a € R, so aq € P. Since P is prime, either 


a or q belongs to P. In the second case, @ C P and we are finished. Thus assume 
a € P, so that a = bp for some b € R. Then p = aq = bpgq, and since R is an integral 
domain and p # 0, we have bq = 1, so q is a unit and Q = R, a contradiction of the 
assumption that Q is prime. 


. Let x be a nonzero element of P, with x = up, --- pn, ua unit and the p; prime. Then 


D1-**Dn = uta € P, and since P is prime, some p; belongs to P. Thus P contains 
the nonzero principal prime ideal (p;). 


Section 2.7 


1. 


If m is a generator of the indicated ideal then m belongs to all (a;), so each a; divides 
m. If each a; divides b then 6 is in every (a;), so b € Ne, (a;) = (m), so m divides b. 
Thus m is a least common multiple of A. Now suppose that m is an lcm of A, and let 
NP_,(a;) = (c). Then c belongs to every (a;), so each a; divides c. Since m = lcm(A), 
m divides c, so (c) is a subset of (m). But again since m = lem(A), each a; divides m, 
som € MP, (ai) = (c). Therefore (m) C (c), hence (m) = (c), and m is a generator 
of (aan (ai) . 
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. Let a = 114 3, b = 8-7. Then a/b = (11 + 37)(8 + 2) /65 = 85/65 + 135/65. Thus 
we may take rq = yo = 1, and the first quotient is qi = 1+7. The first remainder 
is ry = a — bq, = (114+ 32) — (8 —t)(1 +7) = 2 — 4%. The next step in the Euclidean 
algorithm is (8—7)/(2—47) = (8 —7)(2+ 42) /20 = 1+ (31/2). Thus the second quotient 
is gg = 1 +7 (q2 = 1+ 2% would be equally good). The second remainder is rg = 
(8—%) —(2—4%)(1+2) =2+i. The next step is (2—4i) /(2+4) = (2—4i) (2-1) /5 = —24, 
So q3 = —2t, rz = 0. The gcd is the last divisor, namely 2 + 7. 

. We have U(1) < W(1(a)) = U(a) for every nonzero a. If a is a unit with ab = 1, 
then U(a) < U(ab) = U1), so U(a) = U(1). Conversely, suppose that a # 0 and 
W(a) = W(1). Divide 1 by a to get 1 = ag+r, where r =0 or U(r) < U(a) = V(1). 
But if r 40 then V(r) must be greater than or equal to U(1), so we must have r = 0. 
Therefore 1 = aq, and a is a unit. 


. U((a, + bi Vd) (a2 + b2Vd)) 

= wW(a1a2 + bybed 4 (ayb2 t azb,) Vd) 

= |aya2 + by bod + (ayb2 + azb,) Vd |ay az + by bod — (ayb2 + azb,) Vd; 
Wa, + bi, Vd)V (az + bo Vd) 

= lax a bi Vd| |a2 a boVd| lay = bi Vv d| |az = boVd| 
and it follows that U(aB) = VU(a)¥(3). Now W(a) > 1 for all nonzero a, for if 
W(a) = |a? — db?| = 0, then a? = db?. But if b 4 0 then d = (a/b)?, contradicting the 
assumption that d is not a perfect square. Thus b = 0, so a is 0 as well, anda=0,a 
contradiction. Thus U(aZ) = U(a)(B) > V(a). 
. Either d or d—1 is even, so 2 divides d(d — 1) = d? — d = (d+ Vd)(d— Vd). But 2 
does not divide d+ Vd or d— Vd. For example, if 2(a+ b/d) =d+t Vd) for integers 
a,b then 2a — d = (1 — 2b) Vd, which is impossible since Vd is irrational. (If Vd = r/s 
then r? = ds?, which cannot happen if d is not a perfect square.) 


. Define W as in Problem 4 (and Example (2.7.5)). Suppose 2 = a@ where a and (3 are 
nonunits in Z[Vd]. Then 4 = (2) = U(a)¥(@), with U(a), (3) > 1 by Problems 3 
and 4. But then U(a) = (3) = 2. If a=a+bvd then |a? — db?| = 2, so a? — db? is 
either 2 or —2. Therefore if b 4 0 (so that b? > 1), then since d < —3 we have 


a? — db? > 0+3(1) =3, 


a contradiction. Thus b = 0, so a = a, and 2 = W(a) = a’, an impossibility for a € Z. 
. This follows from Problems 5 and 6, along with (2.6.4). 


. Just as with ordinary integers, the product of two Gaussian integers is their greatest 
common divisor times their least common multiple. Thus by Problem 2, the lcm is 
(11 + 37)(8 — t)/(2 +7) = 39 — 132. 

. Ifa = Py, then V(a) = U(G)U(y). By hypothesis, either U(3) or U(y) is 1(= W(1)). 
By Problem 3, either § or ¥ is a unit. 


Section 2.8 


1. If D is a field, then the quotient field F’, which can be viewed as the smallest field 


containing D, is D itself. Strictly speaking, F' is isomorphic to D; the embedding map 
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a ce ac € 
a + (5 + <) and (5 + <) + — both compute to be 


f(a) = a/1 is surjective, hence an isomorphism. To see this, note that if a/b € F, then 
a/b=ab-'/1= f(ab-?). 


. The quotient field consists of all rational functions f(X)/g(X), where f(X) and g(X) 


are polynomials in F[X] and g(X) is not the zero polynomial. To see this, note that 
the collection of rational functions is in fact a field, and any field containing F'-X] must 
contain all such rational functions. 

adf + bcf + bde 


ay Ff bdf 


. *(5+5)-3(4*)--<4* aad 


b\d f b df bdf 
ac ae _acbf+bdae _ acf+dae _ acf +ade 
bd ° bf b?df — bdf  —————sidff 


. If g is any extension of h and a/b € F, there is only one possible choice for g(a/b), 


namely h(a)/h(b). (Since b 4 0 and h is a monomorphism, h(b) 4 0.) If we define g 
this way, then g(a) = g(a/1) = h(a)/h(1) = h(a), so g is in fact an extension of f. 
Furthermore, if a/b = c/d then since h is a monomorphism, h(a)/h(b) = h(c)/h(d). 
Therefore g is well-defined. Again since h is a monomorphism, it follows that 9(¢+5) = 
9(¢) +9($) and g(¢5) = g(#)9(S). Since g is an extension of h, we have g(1) = 1, 
so g is a homomorphism. Finally, if g(a/b) = 0, then h(a) = 0, so a = 0 by injectivity 
of h. Thus g is a monomorphism. 


. The problem is that h is not injective. As before, if g is to be an extension of h, we 


must have g(a/b) = h(a)/h(b). But if 6 is a multiple of p, then h(b) is zero, so no 
such g can exist. 


We must have 9(a/b) = 9(a/1)9((b/1)") = 9(a)9(0)-. 
. Ifa/b = c/d, then for some s€ S' we have s(ad—bc) = 0. So g(s)[g(a)g(d)—g(b)g(c)] = 0. 


Since g(s) is a unit, we may multiply by its inverse to get g(a)g(d) = g(b)g(c), hence 
g(a)g(b)~* = g(c)g(d)~*, proving that 7 is well-defined. To show that 9 is a homomor- 
phism, we compute 


2 Cae ad+be\ | = 
g (¢ + =) =9 (“*) = glad + be)g(bd) 


= [g(a)9(d) + g(d)g(c)]g(0)-*9(a)* =9(F) +9 (4) 


Similarly, we have 9(#$) = 9(4)9($) and g(1) =1. 


Section 2.9 


1. 


We have an(u/v)” + dn—1(u/v)"-1 +--+ + a1(u/v) + ao = 0; multiply by v” to get 
Anu” + dn_pu" ty +--+ + a,uv™! + agv™ = 0. 
Therefore 


Anu” = —an_yu" ty — +++ — ayuv" + = agv”. 
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Since v divides the right side of this equation, it must divide the left side as well, and 
since u and v are relatively prime, v must divide a,. Similarly, 


agu™ = —anu” — an_yu™1v —--»— ayuv™}, 


so u divides ao. 

2. X” — p satisfies Eisenstein’s criterion, and since the polynomial is primitive, it is 
irreducible over Z. 

3. f3(X) = X3+2X +1, which is irreducible over Z3. For if f3(X) were reducible over Zs, 
it would have a linear factor (since it is a cubic), necessarily X — 1 or X + 1(= X — 2). 
But then 1 or 2 would be a root of fs, a contradiction since f3(1) = 1 and f3(2) = 1 
(mod 3). 


4, By Eisenstein, X* + 3 is irreducible over Z. The substitution X = Y +1 yields 
Y*4 4Y? + 6Y? +4Y +4, which is therefore irreducible in Z[Y]. Thus X* + 4X3 + 
6X? +4X + 4 is irreducible in Z[X], i-e., irreducible over Z. 


5. Note that (n,X) is a proper ideal since it cannot contain 1. If (n,X) = (f) then 
n € (f), so n is a multiple of f. Thus f is constant (4 1), in which case X ¢ (f). 

6. Since 1 ¢ (X,Y), (X,Y) is a proper ideal. Suppose (X,Y) = (f). Then Y is a 
multiple of f, so f is a polynomial in Y alone (in fact f = cY). But then X ¢ (f),a 
contradiction. 


7. If p= X +17, then p is irreducible since X +7 is of degree 1. Furthermore, p divides 
X? +1 but p? does not. Take the ring R to be C[X,Y] = (C[X])[Y] and apply 
Eisenstein’s criterion. 

8. Write f(X,Y) as Y3+(X3+4+1) and take p = X +1. Since X?+1 = (X+1)(X?-X+1) 
and X +1 does not divide X? — X + 1, the result follows as in Problem 7. 


Section 3.1 


1. F(S) consists of all quotients of finite linear combinations (with coefficients in F’) of 
finite products of elements of S. To prove this, note first that the set A of all such 
quotients is a field. Then observe that any field containing F and S must contain 
A, in particular, A C F(S). But F'(S) is the smallest subfield containing F and S, 
so F(S) C A. 

2. The composite consists of all quotients of finite sums of products of the form x;, 24, --- 
Li,» M = 1,2,..., where 71, %2,...,%. € I and 2;, € K;;. As in Problem 1, the set A 
of all such quotients is a field, and any field that contains all the K; must contain A. 

3. By (3.1.9), [Fla] : F] = [Fla] : F[G]][F [G6] : F], and since the degree of any extension 
is at least 1, the result follows. 

4. Let min(—1+4+ V2, Q) = aj +a,X + X? (a polynomial of degree 1 cannot work because 
—1+ V2 ¢ Q). Then ao + ai(—1 + V2) + (-1 + V2)? = 0. Since (-1+ V2)? = 
3 — 2,/2, we have ap — a1 + 3 = 0 and a, — 2 = 0, so ag = —1,a, = 2. Therefore 
min(—1 + 2/2,Q) = X?4+2X -1. 
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10. 


. Let B= bo +d,a+-+:+bn_1a"—!. Then for some ag,...,@n € F we have aj + a1 8 + 


--++a,0" = 0. Substituting the expression for @ in terms of a into this equation, 
reducing to a polynomial in a of degree at most n— 1 (as in the proof of (3.1.7)), and 
setting the coefficients of the a’, i= 0,1,...,n—1 equal to zero (remember that the 
a’ form a basis for F[a] over F’), we get n linear equations in the n+ 1 unknowns aj, 
i=0,...,n. We know that a solution exists because (3 is algebraic over F’. By brute 
force (try aj = 1, aj = 0, 7 > 2 for i = 1,2,...,n), we will eventually arrive at the 
minimal polynomial. 


. Define y: F(X) — E by v(f(X)/9(X)) = f(a)/g(a). Note that y is well-defined, 


since if g is a nonzero polynomial, then g(a) 4 0 (because a is transcendental over F’). 
By (3.1.2), y is a monomorphism. Since y(F'(X)) = F(a), it follows that F(X) and 
F(a) are isomorphic. 


. The kernel of y is J, and as in (3.1.3), F[X]/TZ is a field. The image of is Fa], and 


by the first isomorphism theorem for rings, F'[a] is isomorphic to F[X]/I. Therefore 
Fa] is a field, and consequently F'[a] = F(a). 


. If f = gh, then (g + 1)(h +1) =0 in F[X]/I, so F[X]/T is not a field. By (2.4.3), I 


is not maximal. 


. The minimal polynomial over F' belongs to F[X] C E[X], and has a as a root. Thus 


min(a, £) divides min(a, F’). 

The result is true for n = 1; see (3.1.7). Let BE = Flaj,...,@,-1], so that 
[Flay,...,Qn]: Fl] = [Flai,...,Qn]: E|[E: F) = [Elan]: EE: F]. But [Elan] : E] 
is the degree of the minimal polynomial of a,, over E, which is at most the degree of 
the minimal polynomial of a, over F', by Problem 9. An application of the induction 
hypothesis completes the proof. 


Section 3.2 


. f(X) = (X — 2)”, so we may take the splitting field K to be Q itself. 
. f(X) = (X — 1)? +3, with roots 1+ iV3, so K = Q(iV3). Now iV3 ¢ Q since 


(i/3)? = —3 < 0, so [K : Q| > 2. But iV3 is a root of X? +3, so [K : Q < 2. 
Therefore [K : Q] = 2. 


. Let a be the positive 4” root of 2. The roots of f(X) are a,ia,—a and —ia. Thus 


K = Q(a,i). Now f(X) is irreducible by Eisenstein, so [Q(a@) : Q| = 4. Since 
i ¢ Q(a) and i is a root of X?7 +1 € Q(a)[X], we have [K : Q(a)] = 2. By (3.1.9), 
[K:Q])=2x4=8. 


. The argument of (3.2.1) may be reproduced, with the polynomial f replaced by the 


family C of polynomials, and the roots aj,...,az of f by the collection of all roots of 
the polynomials in the family C. 


. Take f = fi--: f,. Since a is a root of f iff a is a root of some f;, the result follows. 
. If the degree is less than 4, it must be 2 (since \/m ¢ Q). In this case, /n = a+ bm, 


son = a? + b?’m+ 2ab\/m. Since m is square-free, we must have a = 0 or b = 0, 
and the latter is impossible because n is square-free. Thus \/n = b\/m, so n = b?m, a 
contradiction of the hypothesis that m and n are distinct and square-free. 
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Section 3.3 


1. 


If ay,...,@n form a basis for EF over F’, then E is generated over F' by the a;. Each 
a; is algebraic over F' because F'(a;) C E, and (3.1.10) applies. 


. There are only countably many polynomials with rational coefficients, and each such 


polynomial has only finitely many roots. Since an algebraic number must be a root of 
one of these polynomials, the set of algebraic numbers is countable. Since the complex 
field is uncountably infinite, there are uncountably many transcendental numbers. 


. The complex field C is algebraically closed, and C is an extension of the rational field Q. 


But C is not algebraic over Q, by Problem 2. 


. The algebraic numbers A form a field by (3.3.4), and A is algebraic over Q by definition. 


But it follows from Section 2.9, Problem 2, that A contains subfields of arbitrarily high 
degree (in fact subfields of every degree) over Q, so that A/Q is not finite. 


. This can be verified by transfinite induction. A splitting field is always an algebraic ex- 


tension (see (3.2.2)), and the field Fc is algebraic over F' by the induction hypothesis. 
The result follows from (3.3.5). 


. By definition of algebraic number, A is an algebraic extension of Q. If @ is algebraic 


over A, then as in (3.3.5), a is algebraic over Q, so a € A. Thus A has no proper 
algebraic extensions, so by (3.3.1), A is algebraically closed. 


. Since £ is an extension of F' we have |F| < |E|. Suppose that a € E and the minimal 


polynomial f of a has roots aj,...,Qn, with a = a;. Then the map a — (f,7) is 
injective, since f and 7 determine a. It follows that |E| < |F[X]|No = |F[X]|. But 
for each n, the set of polynomials of degree n over F has cardinality |F|"*! = |F|, so 
F[X]| = [FINo = |F|. Thus |E| = [F. 


. Let C be an algebraic closure of F’, and let A be the set of roots in C of all polynomials 


in S. Then F(A), the field generated over F' by the elements of A, is a splitting field 
for S over F; see Section 3.2, Problem 4. 


. If F is a finite field with elements a1,...,@, the polynomial f(X) = 1+]]j_,(X —ai) 


has no root in F’, so F cannot be algebraically closed. 


Section 3.4 


. Let f(X) = (X — 1)” over F,. 
. ais a root of X? — a? =(X — a)’, so m(X) divides (X — a)?. 
. By Problem 2, m(X) = (X — a)" for some r. We are assuming that a is separable 


over F'(a?), so m(X) must be simply X — a. But then a € F(a?). 


. The “if’ part follows from the proof of (3.4.5), so assume that F' is perfect and let 


be F. Let f(X) = X” — 6 and adjoin a root a of f. Then a? = b, so F(a?) = 
F(b) = F. By hypothesis, a is separable over F = F(a?), so by Problem 3, a € F. 
But then b is the p'”power of an element of F. 


. If ay,...,Q@, is a basis for FE over F, then by the binomial expansion mod p, 


K = F(al,...,a2). Now since E/F is algebraic, the elements of F(a?) can be ex- 
pressed as polynomials in af with coefficients in F. Continuing, a is algebraic over 
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10. 


11. 


F, hence over F(a‘), so each element of F(ai,a$) can be written as a polynomial in 
a with coefficients in F(a). Such an element has the form 


y ( S- brs") ab® 


with the b,, € F. An induction argument completes the proof. 


. Extend the y; to a basis y1,...,Yn for E over F. By Problem 5, every element of 


E(= F(E?)) has the form y = ayy) +---+ any? with the a; € F. Thus {y!,...,y?} 
spans F over F’. It follows that this set contains a basis, hence (since there are exactly 
n vectors in the set) the set is a basis for EF over F’. The result follows. 


. Assume the extension is separable, and let a € E. Then a is separable over F', hence 


over F'(a?), so by Problem 3, a € F(E?). Thus £ = FE”). Conversely, suppose that 
E = F(E?) and the element a € £ has an inseparable minimal polynomial m(X). 
By (3.4.3), m(X) is of the form bp +b X? +-+-+b,_1X"-YP 4+. X™?, Since m(a) = 0, 
the elements 1,a?,...,a”? are linearly dependent over F. But by minimality of 
m(X), 1,a,...,a@”?~! are linearly independent over F’, hence 1,a,...,a” are linearly 
independent over F’. (Note that rp —1 > 2r—1> pr.) By Problem 6, 1,a?,...,a7? 
are linearly independent over F’, which is a contradiction. Thus E’/F is separable. 


. We may assume that F' has prime characteristic p. By Problem 7, EF = K(E?) and 


K = F(K?). Thus E = F(K?, E?) = F(E?) since K < E. Again by Problem 7, 
E/F is separable. 


. If g can be factored, so can f , and therefore g is irreducible. If f(X) = g(X?”) with 


m maximal, then g ¢ F[X”]. By (3.4.3) part (2), g is separable. 

Suppose that the roots of g in a splitting field are cy,...,c,. Then f(X) = g(X?”") = 
(XP — c)---(X?" — c,). By separability of g, the c; must be distinct, and since 
f(a) = 0, we have a?” = cj for all 7. This is impossible unless r = 1, in which case 
f(X) =X?" — cq. But f € F[X], so a?” =a € F. 

If a?” =c € F, then a is a root of X°" —c = (X — a)", so min(a, F) is a power 
of X — a, and therefore has only one distinct root a. The converse follows from 
Problem 10 with f = min(a, F). 


Section 3.5 


1. 


Take F = Q, K = Q(2) (see (3.5.3)), and let E be any extension of K that is normal 
over F’, for example, E = C. 


. The polynomial f(X) = X? — a is irreducible, else it would factor as (X — b)(X — c) 


with b+ ¢=0, bc =a, ie., (X — b)(X +b) with b? = a, contradicting the hypothesis. 
Thus FE is obtained from Q by adjoining a root of f. The other root of f is —/a, so 
that FE is a splitting field of f over Q. By (3.5.7), E/Q is normal. 


. Take F = Q, K = Q(v2), E = Q(¥2). Then K/F is normal by Problem 2, and 


E/K is normal by a similar argument. But E/F is not normal, since the two complex 
roots of X+—2 do not belong to E. The same argument works with 2 replaced by any 
positive integer that is not a perfect square. 


4, 


Ts 


8. 
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There are at most n embeddings of E in C' extending a. The proof is the same, except 
that now g has at most r distinct roots in C, so there are at most r possible choices 
of 3. The induction hypothesis yields at most n/r extensions from F(a) to E, and the 
result follows. 


. Since the rationals have characteristic zero, the extension is separable. Since E is the 


splitting field of (X? — 2)(X? — 3) over Q, the extension is normal, hence Galois. 


. Since V3 ¢ Q(V2), the extension has degree 4. By (3.5.9), there are exactly four 


Qautomorphisms in the Galois group. By (3.5.1), each such Q-automorphism must 
permute the roots of X?—2 and must also permute the roots of X?—3. There are only 
four possible ways this can be done. Since a Q-automorphism is completely specified 
by its action on 2 and V3, the Galois group may be described as follows: 


(1) V2 V2, V3 > V3; 

(4) Y25--V72, vV3—--v%3. 

Since the product (composition) of any two of automorphisms (2),(3),(4) is the third, 
the Galois group is isomorphic to the four group (Section 1.2, Problem 6). 

Yes, up to isomorphism. If f is the polynomial given in (3.5.11), any normal closure is 
a splitting field for f over F’, and the result follows from (3.2.5). 

If f is irreducible over F and has a root in EF, Eg, then f splits over both EF; and Fa, 
hence all roots of f lie in FE, M Ey. Thus f splits over £, 9 E2, and the result follows. 


Section 4.1 


de 


If a € R, take r(x +J) to be rx +I to produce a left R-module, and (4+ J)r = ar+I 
for a right R-module. Since I is an ideal, the scalar multiplication is well-defined, and 
the requirements for a module can be verified using the basic properties of quotient 
rings. 


. If Ais an algebra over F’, the map x — «x1 of F into A is a homomorphism, and since F’ 


is a field, it is a monomorphism (see (3.1.2)). Thus A contains a copy of F’. Conversely, 
if F is a subring of A, then A is a vector space over F’, and the compatibility conditions 
are automatic since A is commutative. 


. Let R= Zand let M be the additive group of integers mod m, where m is composite, 


say m = ab with a,b > 1. Take = a (mod m) and r = b. 


. Any set containing 0 is linearly dependent, so assume a/b and c/d are nonzero rationals. 


Since Ls is rational, the result follows. 


. In view of Problem 4, the only hope is that a single nonzero rational number a/b spans 


M over Z. But this cannot happen, since an integer multiple of a/b must be a fraction 
whose denominator is a divisor of b. 


.IfaeACCandae€ BNC, then az € (AB) NC. Conversely, let c= ab € (AB) NC. 


Then b=a~!c€ C since AC C. Thus ab € A(BNC). 
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7. If f(X) = a9 +a,X +---+anX” and ve V, take 
f(X)u = f(T)v = aglu + ayTu +--+ +anT”v 


where I is the identity transformation and J” is the composition of T with itself i 
times. 


Section 4.2 


1. Let W be a submodule of M/N. By the correspondence theorem, W = L/N for some 
submodule L of M with L > N. Since L = L+ N, we have W = (L+N)/N. 

2. No. If S is any submodule of M, then S+ N is a submodule of M containing N, 
so $+ N corresponds to W = (S + N)/N. We know that W can also be written as 
(L+N)/N where L > N. (For example, L = S+N.) By the correspondence theorem, 
S+N=L+N, and there is no contradiction. 

3. If Ae M,,(R), then AF, retains column 1 of A, with all other columns zero. 


4. To identify the annihilator of £41, observe that by Problem 4, AF; = 0 iff column 1 
of A is zero. For the annihilator of M, note that Ej, € M for every j, and AE; has 
column j of A as column 1, with zeros elsewhere. (See Section 2.2, Problem 4.) Thus 
if A annihilates everything in M, then column j of A is zero for every j, so that A is 
the zero matrix. 

5. R/I is an R-module by Problem 1 of Section 4.1 If r € R then r+ J =r(14+ J), so 
R/T is cyclic with generator 1 + I. 

6. We must show that scalar multiplication is well-defined, that is, if r € I, then rm = 0 
for all m € M. Thus J must annihilate M, in other words, [M = 0, where the 
submodule IM is the set of all finite sums }° rj;m,,rj; © R,m; € M. 


7. No, since (r + I)m coincides with rm. 


Section 4.3 


1. Essentially the same proof as in (4.3.3) works. If z] +---+ 2, = 0, with z; € Mj, then 
Zn is a sum of terms from previous modules, and is therefore 0. Inductively, every z; 
is 0. (In the terminology of (4.3.3), 2; is xj — y;-) 

2. Only when A = {0}. If A has n elements, then by Lagrange’s theorem, nx = 0 for 
every x € A, so there are no linearly independent sets (except the empty set). 

3. This follows because (—s)r + rs =0. 

4. If I is not a principal ideal, then J can never be free. For if J has a basis consisting of 
a single element, then I is principal, a contradiction. But by Problem 3, there cannot 
be a basis with more than one element. If J = (a) is principal, then J is free if and 
only if a is not a zero-divisor. 

5. Z, or any direct sum of copies of Z, is a free Z-module. The additive group of rational 
numbers is not a free Z-module, by Problem 5 of Section 4.1 
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. The “only if” part was done in (4.3.6), so assume that M has the given property. Con- 


struct a free module M’ = @;¢s5R; where R; = R for all 7. Then the map f: S — M’ 
with f(i) = e; (where e; has 1 in its i‘” component and zeros elsewhere) extends to a 
homomorphism (also called f) from M to M’. Let g: M’ — M be the module homo- 
morphism determined by g(e;) = i. Then go f is the identity on S, hence on M, by 
the uniqueness assumption. Similarly, fog =1. 


. An element of M is specified by choosing a finite subset F' of a, and then selecting an 


element b; € R for each i € F’. The first choice can be made in a ways, and the second 
in |R|!¥! = |R| ways. Thus |M| = a|R| = max(a,|R)). 


. We may take B to the set of “vectors” (e;) with 1 in position 7 and zeros elsewhere. 


Thus there is a basis element for each copy of R, so |B| = a. 


Section 4.4 


Ii, 


ND Or 


To prove that the condition is necessary, take the determinant of the equation 
PP-'=I. Sufficiency follows from Cramer’s rule. 


. A homomorphism f: V — W is determined by its action on elements of the form 


(0,...,0,2;,0,...,0). Thus we must examine homomorphisms from V; to 67,W;. 
Because of the direct sum, such mappings are assembled from homomorphisms from 
V; toW;,i=1,...,m. Thus f may be identified with an mxn matrix whose ij element 


is a homomorphism from V; to W;. Formally, we have an abelian group isomorphism 


Homa(V, W) = [Homr(V;, W;)). 


. In Problem 2, replace V and W by V” and take all W; and V; to be V. This gives an 


abelian group isomorphism of the desired form. Now if f corresponds to [f;;] where 
fiz: Vj — Vi, and g corresponds to [g;;], then the composition go f is assembled 
from homomorphisms giz ° fx;: Vj - Ve — Vi. Thus composition of homomorphisms 
corresponds to multiplication of matrices, and we have a ring isomorphism. 


. In (4.4.1), take n =m=1 and M = R. 

. Since f(x) = f(a1) = xf (1), we may take r = f(1). 

. This follows from Problems 3 and 4, with V = R. 

. If the endomorphism f is represented by the matrix A and g by B, then for any c € R, 


we have c(go f) = (cg) °o f = go (cf), so Endr(M) is an R-algebra. Furthermore, 
cf is represented by cA, so the ring isomorphism is also an R-module homomorphism, 
hence an R-algebra isomorphism. 


Section 4.5 


it, 


Add column 2 to column 1, then add -3 times column 1 to column 2, then add —4 
times row 2 to row 3. The Smith normal form is 


1 0 0 


S=|0 3 0 
0 0 6 
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The matrix P~! is the product of the elementary column matrices in the order in which 
they appeared. Thus 


1 0 0] f1 -3 0 Ly sO 
P-'=]1 1 O] Jo 1 oO] =]1 -2 0 
G08 a Gs. ge A 0 0 1 
30 
P=|-1 1 0 
0 01 


The matrix Q is the product of the elementary row matrices in opposite order (i.e., if 
R, appears first, followed by Ry and R3, then Q = R3R2Rj). In this case there is only 
one matrix, so 


A direct computation shows that QAP! = S. 


. The new basis is given by Y = PX, ie., yy = —221 + 322, yo = —2@1 + 2, y3 = V3. 


The new set of generators is given by V = SY, ie., vj = y1, Vo = 3yo, v3 = bys. 


. Let d; = a,---a;. Then d; is the gcd of the i x i minors of S, and hence of A. The 


a; are recoverable from the d; via a, = d, and a; = d;/d;_1,i > 1. Thus the a; are 
determined by the matrix A and do not depend on any particular sequence leading to 
a Smith normal form. 


. If Aand B have the same Smith normal form S$, then A and B are each equivalent to 


S and therefore equivalent to each other. If A and B are equivalent, then by the result 
stated before Problem 4, they have the same gcd of ¢ xi minors for all 2. By Problem 4, 
they have the same invariant factors and hence the same Smith normal form. 


. Here are the results, in sequence: 


1. The second row is now (3 2 —13 2) 


2. The first row is (3 2 —13 2) and the second row is (6 4 13 5) 
3. The second row is (0 0 39 1) and the third row is (0 0 51 4) 
4, The third row becomes (0 0 12 3) 
5. The second row is (0 0 12 3) and the third row is (0 0 39 1) 
6. The third row is (0 0 3 —8) 
7. The second row is (0 0 3 —8) and the third row is (0 0 12 3) 
8. The third row is (0 0 0 35) 
9. The first row is now (3 2 2 —38) 
10. The final matrix is 
3.2 2 32 
0 0 3 27 
0 0 0 35 
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7. We see from the Hermite normal form that we can take x = 0,y = 7, z = 9, provided 
0 and 35 are congruent mod m. Thus m must be 5, 7 or 35. 


Section 4.6 


1. 441 = 3? x 7?, and since there are two partitions of 2, there are 2 x 2 = 4 mutually 
nonisomorphic abelian groups of order 441, with the following invariant factors: 


(1) a, = 3°72, O& Zan 

(2) ay = 3°71, ag = 377', GY Z7 OG Zo 
(3) ay = 317°, ag = 3'7?, G& Z3 @ Zya7 
(4) a, = 317}, ag = 3'7!, G2 Zn O Zo) 


2. 40 = 2° x 5!, and since there are three partitions of 3 and one partition of 1, there 
are 3 x 1 = 3 mutually nonisomorphic abelian groups of order 40, with the following 
invariant factors: 


(1) ay = 2351, G & Zo 
(2) ay = 215°, ag = 2751, Ge Z9 @ Z20 
(3) ay = 215°, ag = 2159, a3 = 2'51, G 2 Zo @ Ze ® Zy9 


3. The steps in the computation of the Smith normal form are 


1 5 3 1 5 3 1 O 0 1 0 0 
pent a O -ll 1]/—j0 -11 1);,->)0 1 -tI1 
3.4 2 0 -l1l —-7 0 -ll —7 0 -—7 —-Ill 
1 0 0O 1 0 0 
—|0 1 -llj —-|0 1 O 
0 0 —88 0 0 88 


Thus G = Z; © Z; © Zgg = Zgg. 

4. Cancelling a factor of 2 is not appropriate. After the relations are imposed, the group 
is no longer free, so that 2y = 0 does not imply that y = 0. Another difficulty is 
that the submodule generated by 227; + 2x2 + 823 is not the same as the submodule 
generated by 71 + 42 + 423. 

5. Take M = 69°, My, where each M, is a copy of Z. Take N = Z and P = 0. Since the 
union of a countably infinite set and a finite set is still countably infinite, we have the 
desired result. 

6. If N and P are not isomorphic, then the decompositions of N and P will involve 
different sequences of invariant factors. But then the same will be true for M @N and 
M@®P,so M@WN and M © P cannot be isomorphic. 


Section 4.7 


1. If w’ is another solution, then f’u = f’u’(= vf), and since f’ is injective, u = wu’. 
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By commutativity, wg fa = g/vfa, and by exactness, gf = 0. Thus ufa € kerg’ = 
im f’ by exactness. 

For commutativity, we must have f’ua = vfa = f’a’, so ua = a’. Note that a’ is 
unique because f’ is injective. Checking that u is a homomorphism is routine, e.g., if 
ufa; = f'ai,i = 1,2, then vf (a; + a2) = f’(a, +4), so u(ay + a2) = ua, + uazg, etc. 


4. uc= ugb = g'vb. 


13. 


Suppose c = gb; = gbg. Then b, — by € kerg = imf by exactness, so b; — bg = fa. 
Then f’wa = vfa = v(b1 — b2). By exactness, 0 = g' f’wa = g’v(bi — bz), and the 
result follows. 

Add a vertical identity map at the left side of the diagram and apply (ii) of the four 
lemma. 

Add a vertical identity map at the right side of the diagram and apply (i) of the four 
lemma. 

Add a vertical identity map w at the right side of the diagram and apply (ii) of the 
four lemma, shifting the notation [s > t,t > u, uv, v > wu). 


Since u and g’ are surjective, v must be also, by commutativity. 
g J ry y: 


. Since f and w are injective, f’t, hence ¢, must be also, by commutativity. 


. Add a vertical identity map s at the left side of the diagram, and apply (i) of the four 


lemma, shifting notation [t > s, ut, vu, w— UI. 


. If vb = 0, then b = gm, hence 0 = vgm = g’um. Thus um € kerg’ = im f’, say 


um = f'a’. Since t is surjective, a’ = ta, so ufa = f’ta = f’a’. Therefore wm and 
ufa are both equal to f’a’. Since u is injective, m = fa, sob = gm = gfa = 0, 
proving that v is injective. 


Let a’ € A’. Since uw is surjective, f’a’ = um, so vgm = g'um = g’ f'a’ = 0. Since v 
is injective, gm = 0, hence m € kerg = imf, som = fa. Thus um = ufa = f'ta. 
Therefore f’a’ and f’ta are both equal to um. Since f’ is injective, a’ = ta, proving 
that t is surjective. 


Section 5.1 


1. 


The kernel of any homomorphism is a (normal) subgroup. If g € ker® then 
g(zH) = <H for every x € G, so by (1.3.1), e~!gx € H. Take x = g to get g € H. 


. By Problem 1, ker ® is a normal subgroup of G, necessarily proper since it is contained 


in H. Since G is simple, ker ® = {1}, and hence © is injective. Since there are n left 
cosets of H, ® maps into Si. 


_ If |G: H] =n < o, then by Problem 2, G can be embedded in S,,, so G is finite, a 


contradiction. 


4. o(a@H) = cH iff c—1g2 € H iff g €xHax!. 
5. Ife €G, then K = 2Ka~! CxH2x™!, and since z is arbitrary, K C N. 
6. gm (HOK) = (HOR) iff goign € HONK iff gH = goH and gi K = goK, proving 


both assertions. 
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. Since [G : H] and [G: K] are relatively prime and divide [G : HM K] by (1.3.5), their 


product divides, and hence cannot exceed, [G: HM Ky]. 


. By the first isomorphism theorem, G/N is isomorphic to a group of permutations of L, 


the set of left cosets of H. But |L| = [G: H] =n, so by Lagrange’s theorem, |G/N| 
divides |S,,| = nl. 


. Since n > 1, H is a proper subgroup of G, and since N is a subgroup of H, N isa 


proper subgroup of G as well. If N = {1}, then |G| = [G : N], so by Problem 8, 
G divides n!, contradicting the hypothesis. Thus {1} < N < G, and G is not simple. 


Section 5.2 


Li, 


For arbitrary o and 7, we have tam~!(z(i)) = 70(i). In the cycle decomposition of a, 
i is followed by o(i), and in the cycle decomposition of tam~', r(i) is followed by 
mo(i), exactly as in the given numerical example. 


_ If g € Cc(S) and x € S then grg~! = x, so gSg~! = S, hence Cg(S) < Ne(S). 


If g € Na(S) and x € S, then grg~! € S, and the action is legal. As in (5.1.3), 
Example 3, the kernel of the action consists of all elements of Ne(S) that commute 
with everything in S, that is, Ng(S)N Co(S) = Co(S). 


3. We have z € G(gz) iff zgx = gx iff g-1zgx = « iff g-!zg € G(a) iff z € gG(x)g™. 


4, We have giG(x) = g2G(x) iff gy'g1 € G(a) iff gy gia = & iff gia = goa, proving that 


. If g; is an element in the 7 


W is well-defined and injective. If the action is transitive and y € X, then for some zg, 
y = gx = V(gG(az)) and W is surjective. 


. If g,h € G, then h takes gx to hgz. In the coset action, the corresponding statement is 


that h takes gG(x) to hgG(x). The formal statement is that V is a“G-set isomorphism”. 
In other words, W is a bijection of the space of left cosets of G(x) and X, with U(hy) = 
hW(y) for all h € G and y in the coset space. Equivalently, the following diagram is 
commutative. 


gx ——__> hgx 


gG(x) ——> hgG(z) 


. The two conjugacy classes are {1} and G \ {1}. Thus if |G| = n > 1, the orbit sizes 


under conjugacy on elements are 1 and n — 1. But each orbit size divides the order of 
the group, so n — 1 divides n. Therefore n = k(n — 1), where k is a positive integer. 
Since k = 1 is not possible, we must have k > 2, so n > 2(n—1), son < 2. 


th conjugacy class, 1 < i < k, then by the orbit-stabilizer 


theorem, the size of this class is |G|/|Co(g;)|. Since the orbits partition G, the sum of 
the class sizes is |G], and 


1 
et 


i=1 “* 
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where x; = |Ce(gi)|. If, say, g1 = 1, so that x; = |G|, the result follows from the 
observation that each x;, in particular 71, is bounded by N(k). 


Section 5.3 


i, 


The group elements are J, R = (1,2,3,4), R? = ( 
F = (1)(3)(2,4), RF = (1,2)(3,4), R°F = (1,3)(2)(4 
the number of distinct colorings is 


(ni+ntn+ntnr+n? +n? +n?) = (n* + 2n? + 3n? + 2n). 


. Yes. If the vertices of the square are 1, 2,3, 4 in counterclockwise order, we can identify 


vertex 1 with side 12, vertex 2 with side 23, vertex 3 with side 34, and vertex 4 with 
side 41. This gives a one-to-one correspondence between colorings in one problem and 
colorings in the other. 


. If the vertices of the square are 1,2,3,4 in counterclockwise order, then WGGW 


will mean that vertices 1 and 4 are colored white, and vertices 2 and 3 green. The 
equivalence classes are 


{WWWW}, {GGGG}, {WGGG, GWGG, GGWG, GGGW}, 
{GWWW, WGWW, WWGW, Wwwoe}, 
{WWGG, GWWG, GGWW, WGGW}, {WGWG, GwWew}. 


. Label (—1,0) as vertex 1, (0,0) as vertex 2, and (1,0) as vertex 3. Then I = (1)(2)(3) 


and o = (1,3)(2). Thus the number of distinct colorings is $(n? + n?). 


. We have free choice of color in two cycles of J and one cycle of o. The number of 


distinct colorings is $(n? +n). 


. We can generate a rotation by choosing a face of the tetrahedron to be placed on a 


table or other flat surface, and then choosing a rotation of 0,120 or 240 degrees. Thus 
there are 12 rotations, and we have enumerated all of them. By examining what each 
rotation does to the vertices, we can verify that all permutations are even. Since Ay 
has 4!/2 = 12 members, G must coincide with Ay, up to isomorphism. 


. The members of Ag are (1,2,3), (1,3,2), (1,2,4), (1,4,2), (1,3,4), (1,4,3), (2,3,4), 


(2, 4,3), (1,2)(3, 4), (1, 3)(2, 4), (1, 4)(2, 3), and the identity. Counting cycles of length 
1, we have 11 permutations with 2 cycles and one permutation with 4 cycles. The 


number of distinct colorings is 4(n* + 11n?). 


. In the above list of permutations, the first 8 have no fixed colorings. In the next 3, we 


can pick a cycle to be colored B, and pick a different color for the other cycle. This 
gives 2 x 3 = 6 fixed colorings. For the identity, we can pick two vertices to be colored 
B, and then choose a different color for each of the other two vertices. The number 
of fixed colorings is ($)3? = 54. The number of distinct colorings of the vertices is 
[((6x3) + 54)]/12 = 6. 


9. 


10. 


11. 


12. 
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As in Problem 6, a rotation can be generated by choosing a face of the cube to be 
placed on a table, and then choosing a rotation of 0,490 or 180 degrees. Thus there 
are 24 rotations, and we have enumerated all of them. Alternatively, there is a one- 
to-one correspondence between rotations and permutations of the 4 diagonals of the 
cube. Since there are 4! = 24 permutations of a set with 4 elements, there can be no 
additional rotations. The correspondence between rotations and permutations of the 
diagonals yields an isomorphism of G and S4. 


Any permutation of the faces except the identity has a cycle of length 2 or more, and 
each of the faces within that cycle must receive the same color, which is a contradic- 
tion. Thus f(a) = 0 for 7 4 I. Now I fixes all legal colorings, and since there are 6 
colors and 6 faces, the number of legal colorings is 6! = 720. The number of distinct 
colorings is therefore 720/24 = 30. 

Remark This problem can be solved directly without using the heavy machinery 
of this section. Without loss of generality, choose any particular color for a particular 
face, and move the cube so that this face is at the bottom. Choose one of the remaining 
5 colors for the top face. The number of allowable colorings of the 4 remaining sides 
of the cube is the number of circular permutations of 4 objects, which is 3! = 6. The 
number of distinct colorings is 5 x 6 = 30. 

The group G = {1, R, R?,...,R?~*} is cyclic of order p. Since p is prime, each R’, 
i=1,...,p—1, has order p, and therefore as a permutation of the vertices consists 
of a single cycle. Thus the number of distinct colorings is 


= [n+ (p —1)n]. 


Since the result of Problem 11 is an integer, n? + (p—1)n = n? —n+np is a multiple 
of p, hence so is n? — n. Thus for any positive integer n, n? = n mod p. It follows 
that if n is not a multiple of p, then n?~! = 1 mod p. 


Section 5.4 


de, 


Let G act on subgroups by conjugation. If P is a Sylow p-subgroup, then the stabilizer 
of P is Ng(P) (see (5.2.2), Example 4). By (5.2.3), the index of Ng(P) is np. 


. Since P is normal in Neg(P) (see (5.2.2), Example 4), PQ = QP < G by (1.4.3). By 


(5.2.4), PQ is a p-subgroup. 


. The Sylow p-subgroup P is contained in PQ, which is a p-subgroup by Problem 2. 


Since a Sylow p-subgroup is a p-subgroup of maximum possible size, we have P = PQ, 
and therefore Q C P. 


. (a) By definition of normalizer, we have gPg~! < gN@(P)g"! < gHg-| = H. Thus 


P and gPg™' are subgroups of H, and since they are p-subgroups of maximum 
possible size, they are Sylow p-subgroups of H. 

(b) Since H is always a subgroup of its normalizer, let g € Nc(H). By (a), P and 
gPg7' are conjugate in H, so for some h € H we have gPg~! = hPh—!. Thus 
(h-'g)P(h-*g)~1 = P, so h-+g € Nc(P) < H. But then g € H, and the result 
follows. 
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5. By (5.2.4), [NV : PON] = [PN : P] = |PN|/|P|. Since |P| is the largest possible 
power of p for p-subgroups of G, [PN : P] and p must be relatively prime. Therefore 
[N : PN] and p are relatively prime, so PM N is a p-subgroup of N of maximum 
possible size, i.e., a Sylow p-subgroup of N. 

6. By the third isomorphism theorem, [G/N : PN/N] = [G: PN] = |G|/|PN|. Since 
|G|/|P| and p are relatively prime and P < PN, it follows that |G|/|PN| and p are 
relatively prime. The result follows as in Problem 5. 


7. Since f is an automorphism, f(P) is a subgroup of G and has the same number of 
elements as P, in other words, f(P) is a Sylow p-subgroup. By hypothesis, f(P) = P. 

8. By (1.3.5), [G4.: N] = [G: H|[H : N] = p|H : MN], and since [G : N] divides p! = 
p(p — 1)!, the result follows. 

9. If g is a prime factor of [H : N], then by Problem 8, gq is a divisor of some integer 
between 2 and p — 1, in particular, gq < p—1. But by Lagrange’s theorem, gq divides 
|H|, hence q divides |G|. This contradicts the fact that p is the smallest prime divisor 
of |G|. We conclude that there are no prime factors of [H : N], which means that 
[H : N|] =1, Thus H = N, proving that H is normal in G. 


Section 5.5 


1. This follows from (5.5.6), part (iii), with p= 3 and q=5. 

2. Let Z(G)a be a generator of G/Z(G). If gi, g2 € G, then Z7(G)g, = Z(G)a’' for some i, 
so gia~* = 2 € Z(G), and similarly goa~I = z € Z(G). Thus gigo = a'zja)z2 = 
z,z9a'tI = zozjait* = zea) za" = 9291. 

3. By (5.5.3), the center Z(G) is nontrivial, so has order p or p?. In the latter case, 
G = Z(G), so Gis abelian. If |Z(G)| = p, then |G/Z(G)| = p, and G/Z(G) has order p 
and is therefore cyclic. By Problem 2, G is abelian (and |Z(G)| must be p?, not p). 

4. Each Sylow p-subgroup is of order p and therefore has p — 1 elements of order p, with 
a similar statement for g and r. If we include the identity, we have 1+ n,(p—1)+ 
ng(q —1)+n,(r — 1) distinct elements of G, and the result follows. 


5. G cannot be abelian, for if so it would be cyclic of prime order. By (5.5.5), np, nq 
and n; are greater than 1. We know that n, divides gr and np, > 1. But np can’t be 
q since q # 1 mod p (because p > q). Similarly, n, can’t be r, so ny = gr. Now ng 
divides pr and is greater than 1, so as above, ng must be either p or pr (it can’t be r 
because q > r, sor #1 mod q). Thus n, > p. Finally, n, divides pq and is greater 
than 1, so n, is p, g, or pg. Since p > q, we have n, > q. 


6. Assume that G is simple. Substituting the inequalities of Problem 5 into the identity 
of Problem 4, we have 


par =>1+ qr(p—1)+p(q—-1)+q(r-1). 
Thus 


0>pq—p-—qt+1=(p-1)(q-1), 


a contradiction. 
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7. Since |P| = p" with r > 1 and m > 1, we have 1 < |P| < |G|. Since G is simple, P is 
not normal in G. By (5.5.4), n > 1. By Problem 9 of Section 5.1, |G| divides n!. 


8. Assume G simple, and let n = np with p = 5. By Sylow (2), n divides 24 = 16 and 
n =1 mod 5. The only divisors of 16 that are congruent to 1 mod 5 are 1 and 16, and 
1 is excluded by Problem 7. Thus the only possibility is n = 16, and by Problem 7, 
245° divides 16!, hence 5° divides 16!. But in the prime factorization of 16!, 5 appears 
with exponent 3 (not 6), due to the contribution of 5,10 and 15. We have reached a 
contradiction, so G cannot be simple. 


Section 5.6 


1. Apply the Jordan-Hélder theorem to the series 1d NG. 


2. Z has no composition series. By Section 1.1, Problem 6, each nontrivial subgroup of 
Z consists of multiples of some positive integer, so the subgroup is isomorphic to Z 
itself. Thus Z has no simple subgroups, so if we begin with {0} and attempt to build 
a composition series, we cannot even get started. 


3. We have the composition series 1 < Z2 <Z2 @ Z3 = Ze (or 14 Z3 4 Zo G6 Z3 = Ze) and 
14 A3 < S3. 


4. A, consists of products of an even number of transpositions, and the result follows 
from the observation that (a,c)(a,b) = (a,b,c) and (c, d)(a, 6) = (a, d,c)(a, b,c). 


5. If (a,b,c) € N and (d,e, f) is any 3-cycle, then for some permutation 7 we have 
n(a,b,c)m~! = (d,e, f). Explicitly, we can take m(a) = d, 1(b) = e, m(c) = f; see 
Section 5.2, Problem 1. We can assume without loss of generality that a is even, for 
if it is odd, we can replace it by (g,h)a, where g and h are not in {d,e, f}. (We use 
n > 5 here.) Since N is normal, (d,e, f) € N. 


6. If N contains (1,2,3,4), then it contains (1,2,3)(1,2,3,4)(1,3,2) = (1,4,2,3), and 
hence contains (1,4, 2,3)(1,4,3,2) = (1,2,4), contradicting Problem 5. If N con- 
tains (1,2,3,4,5), then it contains (1, 2,3)(1,2,3,4,5)(1,3,2) = (1,4,5,2,3), and so 
contains (1,4,5,2,3)(1,5,4,3,2) = (1,2,4), a contradiction. The analysis for longer 
cycles is similar. [Actually, we should have assumed that N contains a permuta- 
tion z whose disjoint cycle decomposition is ---(1,2,3,4)---. But multiplication by 
mt =---(1,4,3,2)--- cancels the other cycles.] 

7. If N contains (1, 2,3)(4, 5,6), then it must also contain (3, 4, 5)(1, 2, 3)(4,5,6)(3,5,4) = 
(1, 2,4)(3,6,5). Thus N also contains (1, 2,4)(3,6,5)(1,2,3)(4,5,6) = (1,4,3, 2,6), 
which contradicts Problem 6. If the decomposition of a permutation o in N contains 
a single 3-cycle, then o? is a 3-cycle in N, because a transposition is its own inverse. 
This contradicts Problem 5. 

8. If, (1,2)(3,4) € N, then (1,5,2)(1, 
does (1, 5)(3, 4)(1, 2)(3,4) = 2,5 

9. If N contains (1, 2)(3, 4)(5,6)(7, 8), then it contains 


2)(3,4)(1, 2,5) = (1, 5)(3,4) belongs to N, and so 
), contradicting Problem 5. 


(2, 3) (4, 5)(1, 2) (3, 4) (5, 6) (7, 8)(2, 3)(4,5) = (1, 3) (2, 5)(4, 6)(7, 8). 
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10. 


11. 


Therefore N contains 
(1, 3)(2, 5)(4, 6)(7, 8) (1, 2)(3, 4) (5, 6)(7, 8) = (1,5, 4)(2, 3, 6), 


contradicting Problem 7. 


We can reproduce the analysis leading to the Jordan-H6lder theorem, with appropriate 
notational changes. For example, we replace the “subnormal” condition G; < Gj41 
by the “normal” condition G; < G. 


We say that N is a minimal normal subgroup of H if {1} < N <i A and there 
is no normal subgroup of H strictly between {1} and N. In a chief series, there 
can be no normal subgroup of G strictly between G; and Gj41. Equivalently, by 
the correspondence theorem, there is no normal subgroup of G/G;; strictly between 
G;/G; = {1} and Gj41/G;. Thus Gj+41/G; is a minimal normal subgroup of G/G;. 


Section 5.7 


10. 


. $3 is nonabelian and solvable (1 < A3 <4 S3). 


. Let 1 = Go dG, <---<aG, = G be a composition series, with all G/G;_1 cyclic of 


prime order (see (5.7.5)). Since |G;| = |G;/Gi_1||Gi_1| and Gp is finite, an induction 
argument shows that G is finite. 


. The factors of a composition series are simple p-groups P, which must be cyclic of 


prime order. For 1<Z(P) < P, so Z(P) = P and P is abelian, hence cyclic of prime 
order by (5.5.1). [The trivial group is solvable with a derived series of length 0.] 


. $3 is solvable by Problem 1, but is not nilpotent. Since S3 is nonabelian, a central 


series must be of the form 1<H<S3 with H C Z(S3) = 1, a contradiction. 


. If S, is solvable, then so is A, by (5.7.4), and this contradicts (5.7.2). 
. By (5.5.3), P has a nontrivial center. Since Z(P) is normal in P and P is simple, 


Z(P) = P and P is abelian. By (5.5.1), P is cyclic of prime order, and since P is a 
p-group, the only possibility is |P| = p. 


. Let N be a maximal proper normal subgroup of P. (N exists because P is finite 


and nontrivial, and 1< P.). Then the p-group P/N is simple (by the correspondence 
theorem). By Problem 6, |P/N| = p. 


. If P is a Sylow p-subgroup of G, then |P| = p" and by Problem 7, P has a subgroup 


Q, of index p, hence of order p’~!. If Q, is nontrivial, the same argument shows that 
Q, has a subgroup Qz of order p’~?. An induction argument completes the proof. 


. Let G = Dg, the group of symmetries of the equilateral triangle. Take N = {I, R, R?}, 


where R is rotation by 120 degrees. Then N has index 2 in G and is therefore normal. 
(See Section 1.3, Problem 6, or Section 5.4, Problem 9.) Also, N has order 3 and 
G/N has order 2, so both N and G/N are cyclic, hence abelian. But G is not abelian, 
since rotations and reflections do not commute. 


It follows from the splicing technique given in the proof of (5.7.4) that dl(G) < 
dl(N) + dl(G/N). 
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Section 5.8 


1. 


10. 


11. 


Let H = (a | a”), and let C,, be a cyclic group of order n generated by a. Then 
a” = 1 in Cy, and since a"*) = aJ, we have |H| < n = |C,|. The result follows as 
in (5.8.6). 


. The discussion in Example 4 of (2.1.3), with i = a and j = b, shows that the quater- 


nion group Q satisfies all the relations. Since ab = ba™!, it follows as in (1.2.4) that 
every element of the given presentation H is of the form b’a*,r,s € Z. Since b? = a?, 
we can restrict r to 0 or 1, and since a+ = 1, we can restrict s to 0, 1, 2 or 3. Thus 


|H| < 8, and the result follows as in (5.8.6). 


. Take a = (1,2,3) and b = (1,2) to show that $3 satisfies all the relations. Since 


ba = a~'b, each element of H is of the form a’b’,r,s € Z. Since a® = b? = 1, 
|H| <3 x 2 =6, and the result follows as in (5.8.6). 


. No. There are many counterexamples; an easy one is C, = (a| a” = 1,a?" = 1), the 


cyclic group of order n. 


. nz'ny =hehy' ee NOH =1. 
. Take ~ to be the inclusion map. Then 7(h) = 7(h) = 7(1h) = h. To show that 7 is 


a homomorphism, note that nyhyngh2 = ni(hingh;')hihe and hingh{' EN. 


. Ifg € Gthen g = (gr(g)~!)r(g) with r(g) € H and m(gr(g)~!) = r(g)t(g)~! = 1, so 


gm(g)~' € N. [Remember that since we are taking ~ to be inclusion, 7 is the identity 
on H.] Thus G= NH. If g €¢ NOH, then g € kerz and g € H, sog = z7(g) = 1, 
proving that HNN = 1. 


. If we define m(n,h) = (1,h), t(n,1) = (n, 1), and #(1,h) = (1,h), then the sequence 


of Problem 6 is exact and splits on the right. 


. We have (nyh1)(ngh2) = ni(hynzhy')hyh2, so we may take f(h) to be the inner 


automorphism of N given by conjugation by h € H. 


Consider the sequence 


1 CaS eG, 1 


where C3 consists of the identity 1 and the 3-cycles (1,2,3) and (1,3,2), and C2 con- 
sists of the identity and the 2-cycle (1,2). The map # is inclusion, and 7 takes each 
2-cycle to (1,2) and each 3-cycle to the identity. The identity map from C2 to Ss 
gives a right-splitting, but there is no left splitting. If g were a left-splitting map 
from $3 to C3, then g(1,2) = (1,2,3) is not possible because g(1) = g(1,2)g(1,2) = 
(1, 2,3)(1, 2,3) = (1,3,2), a contradiction. Similarly, g(1,2) = (1,3,2) is impossible, 
so g(1,2) = 1, so go7% cannot be the identity. Explicitly, g(2,3) = g((1,2)(1,2,3)) = 
g(1,2,3) = (1,2,3), and (1,3) = g((1,2)(1,3,2)) = 9(1,3,2) = (1,3,2). Conse 
quently, g(1,3,2) = g((1,3)(2,3)) = 1, a contradiction. 

In the exact sequence of Problem 6, take G = Z,2, N = Zp, H = G/N = Z,, i the 
inclusion map, and a the canonical epimorphism. If f is a right-splitting map, its 
image must be a subgroup with p elements (since f is injective), and there is only one 
such subgroup, namely Zp. But then mo f = 0, a contradiction. 
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12. 


13. 


14. 


If g € G, then gPg~' C gNg-! = N, so P and gPg™! are both Sylow p-subgroups 
of N. By Sylow (3), they are conjugate in N (the key point). Thus for some n € N we 
have P = n(gPg~')n~'. But then by definition of normalizer we have ng € Nc(P), 
hence g € NNc(P). 


The multiplication table of the group is completely determined by the relations a” = 1, 
b? = 1, and bab-! = a+. The relations coincide with those of Dz,, with a = R 
and b= F. 


The relation a” = 1 disappears, and we have (a,b | 6? = 1,bab-! = a~+). 


Solutions Chapters 6—10 


Section 6.1 


1. 


We have r, = 2, r2 = 1, r3 = 1 so t) = 1, te = 0, t3 = 1. The algorithm terminates in 
one step after after subtraction of (X1 + X2 + X3)(X1X2X3). The given polynomial 
can be expressed as €1€3. 


. We have r; 2,72 1,73 0 so ty 1, te 1, t3 0. At step 1, subtract 


(Xy + Xo + X3)(X1X2 + X1X3 + X2X3). The result is —3X1X2X3 + 4X, X9X3 = 
X,X2X3. By inspection (or by a second step of the algorithm), the given polynomial 
can be expressed as €1€2 + €3. 


. Equation (1) follows upon taking o1(h) outside the summation and using the linear 


dependence. Equation (2) is also a consequence of the linear dependence, because 
oi(h)oi(g) = oi(hg). 


. By hypothesis, the characters are distinct, so for some h € G we have o1(h) 4 o2(h). 


Thus in (3), each a; is nonzero and 


This contradicts the minimality of r. (Note that the 7 = 2 case is important, since 
there is no contradiction if 01(h) — o;(h) = 0 for all i.) 


. By (3.5.10), the Galois group consists of the identity alone. Since the identity fixes all 


elements, the fixed field of G is Q(*/2). 


. Since C = R{?], an R-automorphism o of C is determined by its action on i. Since 


must permute the roots of X? + 1 by (3.5.1), we have o(i) =i or —i. Thus the Galois 
group has two elements, the identity automorphism and complex conjugation. 


. The complex number z is fixed by complex conjugation if and only if z is real, so the 


fixed field is R. 


Section 6.2 


1. 


The right side is a subset of the left since both FE; and EP 41 are contained in £4). Since 
E; is contained in the set on the right, it is enough to show that aj41 € E;(E?,,). By 


1 


hypothesis, a;41 is separable over F’, hence over E;(a?,,). By Section 3.4, Problem 3, 
wisi € Ela OF 11) C Bi (ER.4): 


2. Apply Section 3.4, Problem 7, with E = F(E?) replaced by Ej4,; = E,(E?,,), to 
conclude that Ej, is separable over FE. By the induction hypothesis, FE; is separable 
over F'. By transitivity of separable extensions (Section 3.4, Problem 8), Fj41 is 
separable over F’. By induction, E'/F is separable. 


3. Let f; be the minimal polynomial of a; over F. Then F is a splitting field for f = 
fi--+ fn over F, and the result follows. 


4. This is a corollary of part 2 of the fundamental theorem, with F replaced by K;_; and 
G replaced by Gal(E'/Kj-1) = Ay. 

5. E(A) is a field containing FE > F and A, hence F(A) contains FE and K, so that by 
definition of composite, EK < F(A). But any field (in particular EK) that contains 
E and K contains E and A, hence contains E(A). Thus E(A) < EK. 

6. If o € G, define U(c)(T(x)) = ro(x),  € E. Then w(o) € G’. [If y =7(a) € F’ with 
xz € F, then U(o)y = V(o)ra = ro(x) = T(x) = y.] Now U(o102)T(x) = To109(x) and 
W(o1)V(o2)7T(4) = V(o1)T02(x) = To 02(x), so V is a group homomorphism. The 
inverse of W is given by W’(o’)t~ly = t~1a'(y), o’ € G’, y € E’. To see this, we 
compute 


W'(U(o))r*y = FW (o)y = 7 W(o)ra = rr 9(2) = o(2) = o(7"y). 


Thus W’W is the identity on G. 


7. Since H’ is a normal subgroup of G, its fixed field L = F (H’) is normal over F,, so 
by minimality of the normal closure, we have N C L. But all fixed fields are subfields 
of N, so L CN, and consequently L = N. 


8. If o € H’, then o fixes everything in the fixed field N, so o is the identity. Thus the 
largest normal subgroup of G that is contained in H is trivial. But this largest normal 
subgroup is the core of H in G, and the resulting formula follows from Problems 4 
and 5 of Section 5.1. 


Section 6.3 


1. G= {o1,...,0n} where o; is the unique F-automorphism of F that takes a to a;. 


2. We must find an a such that 1,a,...,a"~1 is a basis for E/Q. Ifa= bis feet 
bnXn, We can compute the various powers of a and write a’ = cj 1%1 +--+ + CGintn, 
i= 0,1,...,n —1, where each c;; is a rational number. The powers of a will form 
a basis iff det|ci,] e: 0. This will happen “with probability 1”; if a particular choice 
of the 6; yields det[c;;] = 0, a slight perturbation of the 6; will produce a nonzero 


determinant. 

3. By (6.3.1), we may regard G as a group of permutations of the roots a1,...,Q@» of f, 
and therefore G' is isomorphic to a subgroup H of S,. Since G acts transitively on 
the a; (see (6.3.1)), the natural action of H on {1,2,...,n} is transitive. [For an 


earlier appearance of the natural action, see the discussion before (5.3.1).] 


10. 


11. 


. The Galois group G must be isomorphic to a transitive subgroup of S2, which is cyclic 


of order 2. There is only one transitive subgroup of S2, namely S2 itself, so G is a 
cyclic group of order 2. 


. Since [Q(V2) : Q = 2 and [Q(V2, V3) : Q(V2)] = 2, the Galois group G has order 4. 


[Note that V3 ¢ Q(/2) because a+ by2 can never be V3 for a,b € Q.] An automor- 
phism o in G must take 2 to V2 and V3 to +V3. Thus a is either the identity or 
has order 2. Now a group in which every element has order 1 or 2 must be abelian, 
regardless of the size of the group [(ab)(ab) = 1, so ab = b~'a~! = ba]. Since G is not 
cyclic, it must be isomorphic to the four group Zz © Zg. (See the analysis in (4.6.4).) 


. Let H be the subgroup generated by H, and Ho, that is, by H, UH2. Ifo © Hi UA, 


then o fixes K, 9 Kg = K. Since H consists of all finite products (= compositions) 
of elements in H, or Hz, everything in H fixes K, so that K C F(H). On the other 
hand, if « € F(H) but « ¢ K, say « ¢ K,. Then some 7 € H, C FA fails to fix x, so 
x ¢ F(H), a contradiction. Therefore kK = F(H). 


. The fixed field is K,K2, the composite of K, and Kg. For if o fixes K, Ko, then 


it fixes both Ky, and Kg, so o belongs to Hy M Hy. Conversely, if o € H,M Ao, 
then o is the identity on both kK, and Ky. But by the explicit form of KK» (see 
Section 3.1, Problem 1 and Section 6.2, Problem 5), o is the identity on Ky K2. Thus 
F(A, 0 He) = ki Ko. 


. We have E = F(ay,...,Q@,), where the a; are the roots of f. Since min(a;, F’) divides 


the separable polynomial f, each a; is separable over F’. By Section 6.2, Problem 1, 
E is separable over F’. 


. Since [Q(A,2) : Q = [Q(A) : QO[Q(O, 7) : Q(P)] = 4 x 2 = 8, we have |G| = 8. Any 


o € G must map 6 to a root of f (4 choices), and i to a root of X? +1 (2 choices, 7 
or —1). Since o is determined by its action on @ and 7, we have found all 8 members 
of G. 


Let o(0) = i0, o(i) = i, and let r(0) = 0, T(t) = —i. Then o4 = 1, 7? = 1, and the 
automorphisms 1, 0,07, 0°,7,07T, 077,07 are distinct (by direct verification). Also, 
we have ot = ta~! = Ta?. The result follows from the analysis of the dihedral group 


in Section 5.8. 

By direct verification, every member of N fixes i0? = iv/2. Since N has index 2 in G, 
the fixed field of N has degree 2 over Q. But the minimal polynomial of i/2 over Q 
is X? + 2, and it follows that F(N) = Q(iV2}. F(N) is the splitting field of X? + 2 
over Q and is therefore normal over Q, as predicted by Galois theory. 


Section 6.4 


1. 


We have a* = 1+ a+ a? + a? and a® = 1. Thus the powers of a do not exhaust the 
nonzero elements of GF'(16). 


. We may assume that E = GF(p”) and that E contains F = GF(p™), where n = md. 


Then [E: F] = [E: F,|/[F : Fp] = n/m =d. Since E/F is separable, we have 
E = F(a) by the theorem of the primitive element. The minimal polynomial of a 
over F' is an irreducible polynomial of degree d. 


10. 


. Exactly as in (6.4.5), carry out a long division of X" — 1 by X™—1. The division 


will be successful iff m divides n. 


. Since the b; belong to L, we have K C L, and since h € L[X], it follows that g|h. But 


g € K[X] by definition of kK, so h|g. Since g and h are monic, they must be equal. 
In particular, they have the same degree, so [EF : L] =[E: K]. Since K C L, we have 
L=K. 


. Since L = K, L is completely determined by g. But if f = min(a, F), then g divides f. 


Since f has only finitely many irreducible divisors, there can only be finitely many 
intermediate fields L. 


. Since there are finitely many intermediate fields between FE and F’, the same is true 


between LZ and F. By induction hypothesis, L = F(@) for some G6 € L. Thus 
E = L(an) = F(B,0n). 


. By hypothesis, there are only finitely many fields of the form F(cG+a,), c€ F. But 


there are infinitely many choices of c, and the result follows. 


. Since EF = F(G,a,), it suffices to show that G € F(cG+a,). This holds because 


(cB + an) — (dB + an) 
p= 
c—d 


. Let o: F — F be the Frobenius automorphism, given by o(a) = x”. Let f = 


min(a,F,) and g = min(a?,F,). Then f(a?) = f(o(a)) = o(f(a)) since o is a 
monomorphism, and o(f(a@)) = o(0) = 0. Thus g divides the monic irreducible 
polynomial f, so g = f. 

By Problem 9, the subsets are {0}, {1,3, 9}, {2,6,5}, {4, 12,10}, and {7,8,11}. [For 
example, starting with 2, we have 2x 3=6,6x 3=18=5mod13,5x3=15= 
2 mod 13.] In the second case, we get 


{0}, {1, 2,4, 8}, {3,6,9, 12}, {5, 10}, {7, 14, 13, 11}. 


Section 6.5 


1. 


W(X?) = J] ,(X? —w;) where the w; are the primitive n” roots of unity. But the roots 
of X? —w, are the pth roots of w;, which must be primitive np'” roots of unity because 
p is prime and p divides n. The result follows. (The map 6 — @? is a bijection between 
primitive np” roots of unity and primitive n“” roots of unity, because y(np) = py(n).) 


. By (6.5.1) and (6.5.6), the Galois group of the n“” cyclotomic extension of Q can be 


identified with the group of automorphisms of the cyclic group of n*” roots of unity. 
By (6.5.6), the Galois group is isomorphic to U,,, and the result follows. 


3. The powers of 3 mod 7 are 3, 9 = 2, 6, 18 = 4, 12=5, 1. 
4. This follows from Problem 3 and (1.1.4). 


5. og(wtw®) =w® +476 =wtw®, sowt+w® € K. Nowwtw® =w+w! = 2cos2z/7, 


so w satisfies a quadratic equation over Q(cos 27/7). By (3.1.9), 
(Q7: Q =[Q7: K|[K : Q(cos 27/7)|[Q(cos 27/7) : Q) 


where the term on the left is 6, the first term on the right is |(o¢)| = 2, and the second 
term on the right is (by the above remarks) 1 or 2. But [K : Q(cos27/7)] cannot be 2 
(since 6 is not a multiple of 4), so we must have kK = Q(cos 27/7). 

6. gow tw? + wt) = ot t+uttw® = wt+o*%+ 04, sow t+ w*?+u* € ZL; 
o3(wt+w?+w4)=u% +u8 +o =u% +w? +u8 Awt+u%+u4, sow+w*+utEéQ. 
[If w? +w° + w® =w+w?+w*, then we have two distinct monic polynomials of degree 
6 satisfied by w (the other is V7(X)), which is impossible.] 

7. By the fundamental theorem, [LZ : Q] = [G: (o2)| = 2, so we must have L = Q(w+ 
w? + w*), 


8. The roots of Vy are the p’th roots of unity that are not pth roots of unity. Thus 


XP -1 1 


W(X) = xr _ 1. td 


and the result follows. 
9. By Problem 1, 


Wig(X) = Wig)6)(X) = Wo(X?) = X°— XP 41. 


Section 6.6 


1. f is irreducible by Eisenstein, and the Galois group is S3. This follows from (6.6.7) or 
via the discriminant criterion of (6.6.3); we have D(f) = —27(4) = —108, which is not 
a square in Q. 

2. f is irreducible by the rational root test, and D(f) = —4(—3)° — 27 = 108 — 27 = 81, 
a square in Q. Thus the Galois group is A3. 

3. f is irreducible by Eisenstein. The derivative is f’/(X) = 5X*—40X° =5X3(X — 8). 
We have f’(z) positive for x < 0 and for x > 8, and f’(x) negative for 0 < x < 8. Since 
f(0) > 0 and f(8) < 0, graphing techniques from calculus show that f has exactly 3 
real roots. By (6.6.7), G = Ss. 


4. f is irreducible by the rational root test. By the formula for the discriminant of a 
general cubic with a = 3,b = —2,c = 1, we have D = 9(—8) — 4(—8) — 27 — 18(6) = 
—175. Alternatively, if we replace X by X — 3 = X —1, the resulting polynomial is 
g(X) = X3 —5X +5, whose discriminant is —4(—5)? — 27(25) = —175. In any event, 
D is not a square in Q, so G = S3. (Notice also that g is irreducible by Eisenstein, so 
we could have avoided the rational root test at the beginning.) 


5. If f is reducible, then it is the product of a linear factor and a quadratic polynomial g. 
If g is irreducible, then G is cyclic of order 2 (Section 6.3, Problem 4). If g is reducible, 
then all roots of f are in the base field, and G is trivial. 


6. Let the roots be a,b + ic and b—ic. Then 
A = (a—b—ic)(a—b + ic)2ic = ((a — b)? + c?)2ic 


and since i? = —1, we have D < 0. Since D cannot be a square in Q, the Galois group 
is S3. [This also follows from (6.6.7).] 


7. 


If the roots are a,b and c, then D = (a — b)?(a — c)?(b—c)? > 0. The result follows 
from (6.6.3). 


Section 6.7 


1. 


10. 


11. 


12. 


13. 


. Each root of g is of the form w*#, so gg = w 


By (6.7.2), the Galois group of Q(,/m)/Q is Zz for m = 2,3,5,7. It follows that the 
Galois group of Q(V2, V3, V5, V7)/Q is Zz xX Zp X Zp X Z. See (6.7.5), and note that 
Q contains a primitive square root of unity, namely —1. (It is not so easy to prove 
that the Galois group has order 16. One approach is via the texhnique of Section 7.3, 
Problems 9 and 10.) 


. Yes. Let E be the p*” cyclotomic extension of Q, where p is prime. If p > 2, then 
Mu 


Q does not contain a primitive p’” root of unity. By (6.5.6), the Galois group is 
isomorphic to the group of units modp, which is cyclic. 


. Since the derivative of X" — a is nX"~! # 0, it follows as in (6.5.1) that f has n 


distinct roots (1,...,8n in E. Since BP = a and B-" = a7, there are n distinct 
n'” roots of unity in E, namely 1 = 6,6, ', G26, ',.-.,GnG_'. Since the group of n*” 
roots of unity is cyclic, there must be a primitive n“” root of unity in E. 

kod for some k. Since w? = 1, we have 
= 6¢?. But c = 6? since @ is also a root of f, and the result follows. 


. By Problem 4 we have 


Cc cl C24 bp gg? (gac?)? 


with géc? € F. Thus géc? is a root of f in F. 


. [E:: F(w)] divides p and is less than p by (6.7.2); note that E is also a splitting field 


for f over F(w). Thus [E: F(w)] must be 1, so E = F(w). 


. F contains a primitive p“” root of unity w iff E(= F(w)) = F iff X? —c splits over F. 
. By induction, o/(0) = 6+ 7,0 < j < p—1. Thus the subgroup of G that fixes 6, hence 


fixes F'(0), consists only of the identity. By the fundamental theorem, F = F(6). 


. We have a(6? — @) = a(6)? — o(8) = (8+ 1)? — (0+ 1) = & — @ in characteristic p. 


Thus 6? — 6 belongs to the fixed field of G, which is F. Let a = 6? — 0, and the result 
follows. 

Since f(@) = 0, min(0, F) divides f. But the degree of the minimal polynomial is 
[F(0): F] = |E: F]) =p=deg f. Thus f = min(@, F), which is irreducible. 

Since 6°—6 = a, we have (0+1)?—(0+1) = 6°—6 = a. Inductively, 6,0+1,...,6+p—1 
are distinct roots of f in E, and since f has degree p, we have found all the roots and 
f is separable. Since EF is a splitting field for f over F', we have E = F(0). 


By Problem 11, every root of f generates the same extension of F', namely EF. But 
any monic irreducible factor of f is the minimal polynomial of at least one of the 
roots of f, and the result follows. 

[E : F] = [F (6) : F] = deg(min(0, F’)) = deg f = p. Thus the Galois group has prime 
order p and is therefore cyclic. 


Section 6.8 


1. 


10. 


Take the real part of each term of the identity to get 
cos 30 = cos? 6 + 3cos 0(isin 0)? = cos? @ — 3cos @(1 — cos? 8); 
thus cos 36 = 4cos? 6 — 3cos 6. If 39 = 7/3, we have 
cos 1/3 = 1/2 = 4a? — 3a 


so 8a° —6a—1=0. But 8X? —6X —1 is irreducible over Q (rational root test), so 
a is algebraic over Q and [Q(qa) : Q] = 3 (not a power of 2), a contradiction. 


. X3 — 2 is irreducible by Eisenstein, so [Q(W/2) : Q| =3 and V2 is not constructible. 
. The side of such a square would be ./z, so \/7, hence 7, would be algebraic over Q, 


a contradiction. 


.w is a root of X? — 2(cos2r/n)X + 1 since cos2r/n = $(w + w') and 


w* —(w+w tjw+1=0. The discriminant of the quadratic polynomial is nega- 


tive, proving irreducibility over R D Q(cos27/n). 


. By (6.5.2), (6.5.5) and (3.1.9), 


p(n) = [Q() : Q| = [Qw) : Q(cos 21/n)|[Q(cos 2x /n) : Q. 


By Problem 4, [Q(w) : Q(cos27/n)] = 2, and if the regular n-gon is constructible, 
then [Q(cos 27/n) : Q| is a power of 2. The result follows. 


. By hypothesis, G = Gal(Q(w)/Q) is a 2-group since its order is y(n). Therefore 


every quotient group of G, in particular Gal(Q(cos27/n)/Q), is a 2-group. [Note 
that by (6.5.1), G is abelian, hence every subgroup of G is normal, and therefore 
every intermediate field is a Galois extension of Q. Thus part 2c of the fundamental 
theorem (6.2.1) applies.| 


. By the fundamental theorem (specifically, by Section 6.2, Problem 4), there are fields 


Q=Kk) < Ki <--- < K, = Q(cos2r/n) with [K; : Kj-1] = 2 for all i = 1,...,r. 
Thus cos 27/n is constructible. 


. Ifnm=py'--- ps, then (see Section 1.1, Problem 13) 


e,—1 


y(n) = py "(pi — 1) ++ pe" (pr — 1). 


If p; #4 2, we must have e; = 1, and in addition, p; — 1 must be a power of 2. The 
result follows. 


. If mis not a power of 2, then m can be factored as ab where a is odd and 1<b<™m. 


In the quotient (X* +1)/(X +1), set X = 2°. It follows that (2” +1)/(2°4+ 1) is an 
integer. Since 1 < 2?+1<2™+1, 2™+1 cannot be prime. 


The e; belong to FE and are algebraically independent over K, so the transcendence 
detree of EF over K is at least n. It follows that the a; are algebraically independent 
over Ky, and the transcendence degree is exactly n. Therefore any permutation of 
the a; induces a K-automorphism of F = K(aj,...,Q@n) which fixes each e;, hence 
fixes F. Thus the Galois group of f consists of all permutations of n letters. 


11. Since S;, is not solvable, the general equation of degree n is not solvable by radicals 


if n > 5. In other words, if n > 5, there is no sequence of operations on €1,...,€n 
involving addition, subtraction, multiplication, division and extraction of m*” roots, 
that will yield the roots of f. 


Section 6.9 


1. 


If S is not maximal, keep adding elements to S until a maximal algebraically indepen- 
dent set is obtained. If we go all the way to T, then T is algebraically independent and 
spans F/ algebraically, hence is a transcendence basis. (Transfinite induction supplies 
the formal details.) 


. For the first statement, take T = E in Problem 1. For the second statement, take 


S=9. 

(i) implies (ii): Suppose that t; satisfies f(t;) = bo + bit; +--+ + bmt?” = 0, with 
b; € F(T \ {ti}). By forming a common denominator for the b;, we 
may assume that the b; are polynomials in F[T \ {t;}] C F[Z]. By (i), 
b; = 0 for all 7, so f =0. 

(ii) implies (iii): Note that F(t1,...,t:-1) C F(T \ {ti}). 

(iii) implies (i): Suppose that f is a nonzero polynomial in F[X1,...,Xm] such that 
f(t1,---,tm) = 0, where m is as small as possible. Then f = hg + 
hiXm +--+: +h-X}, where the h; belong to F[X1,...,Xm-—i]. Now 


f(ti, see itm) = bo + bitm teeet bth where b; = hj (ti, sae wtm—1)- 
If the b; are not all zero, then t,, is algebraic over F(t1,...,tm—1), 
contradicting (iii). Thus b; = 0, so by minimality of m, h; = 0, 
so f =0. 


. If SU {t} is algebraically dependent over F', then there is a positive integer n and 


a nonzero polynomial f in F[X1,...,Xn,Z] such that f(ti,...,tn,t) = 0 for some 
ti,..-,tn € S. Since S is algebraically independent over F’, f must involve Z. We 
may write f = bob +b1:Z +--+: +bmZ™ where bm # 0 and the b; are polynomials in 
F([X1,...,Xn]. But then t is algebraic over F'(S). 

Conversely, if ¢ is algebraic over F(S), then for some positive integer n, there are 
elements t),...,t, € S such that t is algebraic over F(t1,...,tn). By Problem 3, 
{t1,...,tn,t} is algebraically dependent over F’, hence so is SU {t}. 


. Let A= {81,...,8m,t1,---,tn} be an arbitrary finite subset of SUT, with s; € S and 


t; € T. By Problem 3, s; is transcendental over F(s1,...,s;-1) and t; is transcendental 
over K(ti,...,t;-1), hence over F(s1,...,5m,ti,...,tj-1) since S C K. Again by 
Problem 3, A is algebraically independent over F’. Since A is arbitrary, S U T is 
algebraically independent over F. Now if t € K then {t} is algebraically dependent 
over K (t is a root of X —t). But if t also belongs to T, then T is algebraically 
dependent over K, contradicting the hypothesis. Thus K and T, hence S$ and T, are 
disjoint. 


. By Problem 5, SUT is algebraically independent over F’. By hypothesis, FE is algebraic 


over K(T) and K is algebraic over F(S). Since each t € T is algebraic over F'(S')(T) = 


F(SUT), it follows that K(T) is algebraic over F(.S UT). By (3.3.5), E is algebraic 
over F(S UT). Therefore SU T is a transcendence basis for E/F’. 


. If T is algebraically independent over F’, the map f(X1,...,Xn) > f(ti,---,tn) ex- 


tends to an F-isomorphism of F'(X1,...,X») and F(ti,...,tn). Conversely, assume 
that F(T) is F-isomorphic to the rational function field. By Problem 2, there is a tran- 
scendence basis B for F(T)/F such that B C T. By (6.9.7), the transcendence degree 
of F(T)/F is |T| =n. By (6.9.5) or (6.9.6), B = T, so T is algebraically independent 
over F’. 


. The “if” part is clear since [K(z) : K] can’t be finite; if so, [F': K] < oo. For the “only 


if” part, z is algebraic over K(x), so let 
2” + Pn_1(a)2"7* +++ + Yo(z) =0, — € K(z). 


Clear denominators to get a polynomial f(z,x) = 0, with coefficients of f in K. Now 
x must appear in f, otherwise z is not transcendental. Thus x is algebraic over K(z), 
so [K(z,2) : K(z)] < co. Therefore 

[FP K(2)] = [F: K(z,2)|[K (2,2) : K(). 


The first term on the right is finite since K(x) C K(z,), and the second term is finite, 
as we have just seen. Thus [F’: K(z)] < oo, and the result follows. d& 


. We have trdeg(C/Q) = c, the cardinality of C (or R). For if C has a countable 


transcendence basis 21, 22,... over Q, then C is algebraic over Q(21, 22,...). Since a 
polynomial over Q can be identified with a finite sequence of rationals, it follows that 
|C| = |Q|, a contradiction. 


Section 7.1 


1. 


Replace (iii) by (iv) and the proof goes through as before. If R is a field, then in (iii) 
implies (i), « is an eigenvalue of C, so det(xI — C) = 0. 


. Replace (iii) by (v) and the proof goes through as before. [Since B is an A[a]-module, 


in (iii) implies (i) we have 73; € B; when we obtain [det(#J —C)]b = 0 for every b € B, 
the hypothesis that B is faithful yields det(aI — C) = 0.] 


. Multiply the equation by a”~! to get 


aqot= =(Cp-1 tees + cyan? + cpa” +) eA. 


. Since A[b] is a subring of B, it is an integral domain. Thus if bz = 0 and b ¥ 0, 


then z = 0. 


. Any linear transformation on a finite-dimensional vector space is injective iff it is 


surjective. Thus if b € B and b £0, there is an element c € A[b] C B such that be = 1. 
Therefore B is a field. 


. P is the preimage of Q under the inclusion map of A into B, so P is a prime ideal. 


The map a+ P > a+Q is a well-defined injection of A/P into B/Q, since P= QN A. 
Thus A/P can be viewed as a subring of B/Q. 
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7. 


If b+ Q € B/Q, then 6 satisfies an equation of the form 
a" + Gn—10" 1 +---+a1¢+a9 =0, a € A. 


By Problem 6, b+ Q satisfies the same equation with a; replaced by a; + P for all 7. 
Thus B/Q is integral over A/P. 


. By Problems 3-5, A/P is a field if and only if B/Q is a field, and the result follows. 


(Note that since Q is a prime ideal, B/Q is an integral domain, as required in the 
hypothesis of the result just quoted.) 


Section 7.2 


1. 


By the quadratic formula, L = Q(Vb?—4c). Since b? — 4c € Q, we may write 
b? — 4c = s/t = st/t? for relatively prime integers s and t. We also have s = uy? 
and t = vz? where u,v,y,z € Z, with u and v relatively prime and square-free. Thus 


L = Qu) = Q(va). 


. If Q(Vd) = Qe), then Vd = a+ bie for rational numbers a and b. Thus d = 


a? + b?e + 2aby/e, so \/e is rational, a contradiction (unless a = 0 and b = 1). 


. Any isomorphism of Q(Vd) and Q(,/e) must carry Vd into a+by/ for rational numbers 


a and b. Thus d is mapped to a? + b?e + 2ab\/e. But a Q-isomorphism maps d to d, 
and we reach a contradiction as in Problem 2. 


. Since wy = w3,, we have wn € Q(wen), 80 Q(wn) C Q(wan). If n is odd then n+1 = 2r, 


sO 


Therefore Q(wan) C Q(wn). 


. Let f be a monic polynomial over Z with f(x) = 0. If f is factorable over Q, then it is 


factorable over Z by (2.9.2). Thus min(z, Q) is the monic polynomial in Z[X] of least 
degree such that f(x) = 0. 


. Q(/—3) = Q(w) where w = —4 + $/—3 is a primitive cube root of unity. 


. Ifn =[L: Q], then an integral basis consists of n elements of L that are linearly inde- 


pendent over Z, hence over Q. (A linear dependence relation over Q can be converted 
to one over Z by multiplying by a common denominator.) 


Section 7.3 


1. The Galois group of E/Q consists of the identity and the automorphism o(a+bVd) = 


a — bVd. By (7.3.6), T(z) = «+ (x) = 2a and N(a) = xo(x) = a? + db?. 


2. A basis for E/Q is 1,0,07, and 


1-62, 69=63=36-1, 676? = 64 = 96° = 367 8. 
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Thus 
Be -0 - 29 
mé?)=|0 3 —1 
1 0 8 


and we have T(6”) = 6, N(0?) = 1. Note that if we had already computed the norm 
of 6 (the matrix of @ is 


0 0 -1 
mé=|1 0 3 
0 1 O 
and T(0) = 0, N(@) = —1), it would be easier to calculate N(6?) as [N(0)|? = 
(1)? =1. 


. The cyclotomic polynomial Wg has only two roots, w and its complex conjugate W. 
By (7.3.5), 


T(w) =w+o= el + e—*7/3 — 2c08(1/3) = 1. 


. By (7.3.6), N(x) = xo(ax)---o" +(x) and T(z) = x+o(x) +--- +0" (a). If 
e = y/o(y), then o(x) = o(y)/o7(y),...,0° (x) = a" “*(y)/o"(y) = 0" *(y)/y, 
and the telescoping effect gives N(x) = 1. If « = z— o(z), then o(x) = o(z) — 0? (z), 


.,0" "(x) =o" 1(z) — z, and a similar telescoping effect gives T(x 
. Choose v € FE such that 


y =v+20(v) + 20(x)o?(v) +-+++ 20(x)---0"-2(x)a" "1 (v) £0 
and hence 
a(y) = o(v) + o(x)07(v) + o(x)0?(x)o3(v) +--+ + 0(x)07(x2)+--0" 1 (x)a"(v). 


We are assuming that N(x) = xo(x)---o"~1(x) = 1, and it follows that the last 
summand in o(y) is 2~to”(v) = x~!v. Comparing the expressions for y and o(y), we 


have xo(y) = y, as desired. 


. Since T(x) = 0, we have —x = o(x) +---+0"71(zx), so the last summand of o(w) 
is —cu. Thus 


. We have 


since T(u) belongs to F' and is therefore fixed by o (see (7.3.3)). By Problem 6, 
z—o(z) =. 
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10. 


. No. We have y/o(y) = y//a(y’) iff y/y’ = o(y/y’) iff y/y’ is fixed by all automor- 


phisms in the Galois group G iff y/y’ belongs to the fixed field of G, which is F. 
Similarly, z — o(z) = 2! — o(2’) iff z-— 2 € F. 


. We have min(#,Q) = X* — 2, min(6?,Q) = X? — 2, min(6?,Q) = X* - 8, 


min(V/30,Q) = X4— 18. (To compute the last two minimal polynomials, note that 
(9°)* = (64)3 = 23 =8, and (/36)* = 18.) Therefore, all 4 traces are 0. 

Suppose that /3 = a+b0+c6? + dé? with a,b,c € Q. Take the trace of both sides to 
conclude that a = 0. (The trace of V3 is 0 because its minimal polynomial is X?— 3.) 
Thus V3 = b0+c6? +d0°, so 30 = b6? + ch3 42d. Again take the trace of both sides 
to get d= 0. The same technique yields b = c = 0, and we reach a contradiction. 


Section 7.4 


1. 


If I(y) = 0, then (a, y) = 0 for all x. Since the bilinear form is nondegenerate, we 
must have y = 0. 


. Since V and V* have the same dimension n, the map y — I(y) is surjective. 


. We have (2, y;) = l(y;)(i) = f;(xi) = dij. Since the f; = I(y;) form a basis, so do 


the Yj. 


. Write x; = Sd aikYx, and take the inner product of both sides with x; to conclude 


that aij = (x4, 23). 


. The “if” part was done in the proof of (7.4.10). If det C = +1, then C7! has coeffi- 


cients in Z by Cramer’s rule. 


. If d#1 mod 4, then by (7.2.3), 1 and Vd form an integral basis. Since the trace of 


a + bV 4d is 2a (Section 7.3, Problem 1), the field discriminant is 


2 0 


D=aet | od 


| =4a 
If d= 1 mod 4, then 1 and $(1+ Vd) form an integral basis, and 


[50+ va) =i+ft pve 


Thus 


D =det fi is1| agi 
I 


. The first statement follows because multiplication of each element of a group G by a 


particular element g € G permutes the elements of G. The plus and minus signs are 
balanced in P+ N and PN, before and after permutation. We can work in a Galois 
extension of Q containing L, and each automorphism in the Galois group restricts to 
one of the o; on L. Thus P+ N and PN belong to the fixed field of the Galois group, 
which is Q. 


10. 


11. 


12. 


13. 
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. Since the x; are algebraic integers, so are the o;(x;), as in the proof of (7.3.10). By 


(7.1.5), P and N, hence P+ N and PN, are algebraic integers. By (7.1.7), Z is 
integrally closed, so by Problem 7, P+ N and PN belong to Z. 


. D=(P-—N)?=(P+N)?—4PN =(P+N)? mod 4. But any square is congruent 


to 0 or 1 mod 4, and the result follows. 


We have y; = Dt aijz; with aj; € Z. By (7.4.3), D(y) = (det A)? D(z). Since D(y) 
is square-free, det A = +1, so A has an inverse with entries in Z. Thus x = A~'y, as 
claimed. 


Every algebraic integer is a Z-linear combination of the x;, hence of the y; by Prob- 
lem 10. Since the y; form a basis for LZ over Q, they are linearly independent and the 
result follows. 

No. For example, let L = Q(Vd), where d is a square-free integer with d # 1 mod 4. 
(See Problem 6). The field discriminant is 4d, which is not square-free. 


This follows from the proof of (7.4.7). 


Section 7.5 


P Ag C Ay C Ag C=: 
. Let a/p” € B, where p does not divide a. There are integers r and s such that 


ra+ sp” = 1. Thus ra/p” = 1/p” in Q/Z, and A, C B. If there is no upper bound 
on n, then 1/p” € B for all n (note 1/p” = p/p"*! = p?/p"*?, etc.), hence B = A. If 
there is a largest n, then for every m > n, BN Am © An by maximality of n. Therefore 
B= Ay. 


. Let £1,%2,... be a basis for V. Let M,. be the subspace spanned by 21,...,2,, and 


L, the subspace spanned by the z;,j > r. If V is n-dimensional, then V = Lo > Ly > 
--+ > In, > Ly = 0 is a composition series since a one-dimensional subspace is a 
simple module. [V = M, > Mn-1 > .--- > Mi > 0 is another composition series.] 
Thus V is Noetherian and Artinian. If V is infinite-dimensional, then M,; < Mj <.-:-- 
violates the acc, and Ip > Ly > Lg > --- violates the dec. Thus V is neither 
Noetherian nor Artinian. [Note that if V has an uncountable basis, there is no problem; 
just take a countably infinite subset of it.] 


. 1(M) is finite iff MM has a composition series iff M is Noetherian and Artinian iff NV 


and M/N are Noetherian and Artinian iff I(N) and [(M/N) are finite. 


. By Problem 4, the result holds when /(M) = oo, so assume I(M), hence /(.N) and 


I(M/N), finite. Let 0 << Ny <--. < N, = WN be a composition series for N, and let 
N/N <(M,4+ N)/N <---<(M,+ N)/N = M/N be a composition series for M/N. 
Then 


0O<N, <-:--<N<M,4+N<---< M+ N=M 


is a composition series for M. (The factors in the second part of the series are simple 
by the third isomorphism theorem.) It follows that (M7) =r+s=I(N)+l(M/N). 
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. By (7.5.9), R is a Noetherian S-module, hence a Noetherian R-module. (Any R- 


submodule T of FR is, in particular, an S-submodule of R. Therefore T is finitely 
generated.) 


. Yes. Map a polynomial to its constant term and apply the first isomorphism theorem 


to show that R = R[X]/(X). Thus R is a quotient of a Noetherian R-module, so is 
Noetherian by (7.5.7). 


. If there is an infinite descending chain of submodules M; of M, then the intersection 


N =1,M; cannot be expressed as the intersection of finitely many M;. By the corre- 
spondence theorem, N;(M;/N) = 0, but no finite intersection of the submodules M;/N 
of M/N is 0. Thus M/N is not finitely cogenerated. Conversely, suppose that M/N is 
not finitely cogenerated. By the correspondence theorem, we have 1,.M, = N, but no 
finite intersection of the M, is N. Pick any My and call it M,. If M, C Mg for all a, 
then M, = N, a contradiction. Thus we can find M, = M2 such that M; > M,N Mo. 
Continue inductively to produce an infinite descending chain. 


Section 7.6 


1. 


The “only if” part follows from (7.6.2). If the given condition is satisfied and ab € P, 
then (a)(b) C P, hence (a) C P or (b) C P, and the result follows. 


. If x; ¢ P; for some i, then x; € J \ Uf_, P; and we are finished. 
. Since I is an ideal, « € I. Say x € P,. All terms in the sum that involve x; belong 


to P, by Problem 2. The remaining term x2--- 2, is the difference of two elements in 
P,, hence r2:+-%, € P,. Since P; is prime, x; € P; for some j ¥ 1, contradicting the 
choice of x;. 


. The product of ideals is always contained in the intersection. If J and J are relatively 


prime, then l=a2+y witha elandyec J. IfzeInJ, then z=zl=zr+zye lJ. 
The general result follows by induction, along with the computation 


R=(h+I3)d24+ Is) Ch + Is. 


Thus J, J> and Js are relatively prime. 


. See (2.6.9). 
. Assume that R is not a field, equivalently, {0} is not a maximal ideal. Thus by (7.6.9), 


every maximal idea is invertible. 


. Let r be a nonzero element of R such that rk C R, hence K C r-'R C K. Thus 


K =r7!R. Since r~? € K we have r~? = r~'s for some s € R. But thenr~'=s € R, 
so kK C R and consequently K = R. 


. R= R= (P, + Po)” C Pi + Py. Thus Pf and P» are relatively prime for all r > 1. 


Assuming inductively that Pf and P; are relatively prime, we have 
P§ = P§R= P3(PL +P.) CPP+ Ph 
so 


R= Pi + PSC Pr + (P+ Pet) = Pr+ pet 
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completing the induction. 


Section 7.7 


1. 


By Section 7.3, Problem 1, the norms are 6, 6, 4 and 9. Now if ¢ = a+ b/—5 and 
x = yz, then N(x) = a? + 5b? = N(y)N(z). The only algebraic integers of norm 1 
are +1, and there are no algebraic integers of norm 2 or 3. Thus there cannot be a 
nontrivial factorization of 1+ —5, 2 or 3. 


. If (a+b/—5)(c+d/—5) = 1, take norms to get (a? + 5b?)(c? +5d?) = 1, s0ob=d=0, 


a=+l,c= 


. By Problem 2, if two factors are associates, then the quotient of the factors is +1, 


which is impossible. 


. This is a nice application of the principle that divides means contains. The greatest 


common divisor is the smallest ideal containing both J and J, that is, [+ J. The least 
common multiple is the largest ideal contained in both I and J, which is IM J. 


. If I is a fractional ideal, then by (7.7.1) there is a fractional ideal I’ such that II’ = R. 


By definition of fractional ideal, there is a nonzero element r € R such that rl’ is an 
integral ideal. If J = rl’, then IJ = Rr, a principal ideal of R. 


. This is done just as in Problems 1-3, using the factorization 18 = (2)(3?) = (1+ 


v—17)(1 — Y—17). 


. By (7.2.2), the algebraic integers are of the form a + b/—3,a,b € Z, or $+ 3V-3 


2° 2 
where u and v are odd integers. If we require that the norm be 1, we only get +1 in 


the first case. But in the second case, we have u? + 3v? = 4, sou = +1,v = £1. Thus 


if w = e?"/3, the algebraic integers of norm 1 are +1,+w, and +w”. 


Section 7.8 


1. 


If Rx and Ry belong to P(R), then (Rx)(Ry)~! = (Rx)(Ry~') = Rry! € P(R), and 
the result follows from (1.1.2). 


. If C(R) is trivial, then every integral ideal J of R is a principal fractional ideal Rx, x € 


Kk. But I C R, so x = 1x must belong to R, proving that R is a PID. The converse 
holds because every principal ideal is a principal fractional ideal. 


9/1 BER), 86 (yb) yb) SO SPD 
4. Since 2 € Py, it follows that 4 € P?, so by Problem 3, 2= 6-4 € P3. 
5. (2,1+V—5)(2, 1+ V—5) = (4,2(1+ V5), (1+-V—5)?), and (14+-/—3)? = 442-5. 


Thus each of the generators of the ideal P? is divisible by 2, hence belongs to (2). 
Therefore P? C (2). 


. 2?+5 = (x+1)(x-1) mod 3, which suggests that (3) = P3P3, where P3 = (3,1+./—5) 


and Pi = (3,1 — V5). 


. P3 Ps = (8,3(1—V—5), 3(1+ V/—5), 6) C (8) since each generator of P3P3 is divisible by 


3. But 3 € P3M P3, hence 9 € P3P3, and therefore 9-6 = 3 € P3P3. Thus (3) C P3P%, 
and the result follows. 
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Section 7.9 


1. 


Using (1), the product is z = 4+ 2p+ 4p? + p?4 p*. But 4=3+4+1=1+4 5 and 
Ap? = p? + 3p? = p?+p®. Thus z= 14+ 3p +p? + 2p? + pt = 14 2p? + 2p? + pt. 
Using (2), we are multiplying « = {2,5,14,14,...} by y = {2,2,11,11,...}. Thus 
z = 4,21 = 10,22 = 154,23 = 154,24, = 154, and so on. But 4 = 1 mod 3, 10 = 
1 mod 9, 154 = 19 mod 27, 154 = 73 mod 81, 154 = 154 mod 243. The standard form 
is {1,1,19, 73,154, 154,...}. As a check, the product is (2+3+9)(2+9) = 154, whose 
base 3 expansion is 1 + 0(3) + 2(9) + 2(27) + 1(81) as found above. 


. We have ag = —1 and a, = 0 for n > 1; equivalently, x, = —1 for all n. In standard 


form, x9 = p—1,21 = p?—1, 22 = p>—1,....Since (p” —1)—(p"™—!—1) = (p—1)(p"—}), 
the series representation is 


(p—1) + (p—1)p+ (p—1)p? +--+: + (p— 1p" +---. 


The result can also be obtained by multiplying by -1 on each side of the equation 


1=(1—p)(1+p+p74---). 


. Let x be a nonzero element of GF(q). By (6.4.1), 7+ = 1, so |a|{~! = 1. Thus |2| is 


a root of unity, and since absolute values are nonnegative real, we must have |z| = 1, 
and the result follows. 


. If the absolute value is nonarchimedian, then S' is bounded by (7.9.6). If the absolute 


value is archimedian, then by (7.9.6), |n| > 1 for some n. But then |n*| = |n|* — oo 
as k — oo. Therefore S' is unbounded. 


. A field of prime characteristic p has only finitely many integers 0,1,...,p—1. Thus 


the set S of Problem 4 must be bounded, so the absolute value is nonarchimedian. 


. The “only if? part is handled just as in calculus. For the “if” part, note that by (iv) 


of (7.9.5), we have |Zm + 2m41 +++: + Zn| < max{|z;|: m <i<n}—>0as m,n — oc. 
Thus the n*” partial sums form a Cauchy sequence, which must converge to an element 


in Q,. 


. Since n! = 1-2---p--+-2p---3p---, it follows from (7.9.2) and (7.9.3) that if rp <n < 


(r + 1)p, then |n!| = 1/p". Thus |n!| — 0 as n > on. 


. No. Although |p"| = 1/p" > 0 as r — on, all integers n such that rp <n < (r+1)p 


have p-adic absolute value 1, by (7.9.2). Thus the sequence of absolute values |n| 
cannot converge, hence the sequence itself cannot converge. 


Section 8.1 


I. 
2. 


Ifa é€V and f(a) 40, then fo(x) must be 0 since fi fo € I(V); the result follows. 
By Problem 1, V C V(fi) UV(f2). Thus 


V=(VOV(fi))UV OV(f2)) = VU Va. 


Since f; ¢ I(V), there exists  € V such that fi(x) 40. Thus x ¢ Vi, soVi CV; 
similarly, V2 C V. 
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. I(V) D I(W) by (4). If 1(V) = I(W), let V = V(S),W = V(T). Then IV(S) = 


IV(T), and by applying V to both sides, we have V = W by (6). 


. Let « EV; if fi(v) £0, then since fy € 1(Vi), we have x ¢ Vj. But then x € V2, and 


therefore fo(a) = 0 (since fg € I(V2)). Thus fi fo =0 on V, so fifo € I(V). 


. If V is reducible, then V is the union of proper subvarieties V; and V2. If V, is 


reducible, then it too is the union of proper subvarieties. This decomposition process 
must terminate in a finite number of steps, for otherwise by Problems 1—4, there 
would be a strictly increasing infinite sequence of ideals, contradicting the fact that 
k[X1,...,Xp] is Noetherian. 


-HV=U,Yu= U; W;, then V; = U, Vi NW3), so by irreducibility, V; = Vif W; for 


some j. Thus V; C W;, and similarly W; C Vz for some k. But then V; C Vz, hence 
i = k (otherwise we would have discarded V;). Thus each V; can be paired with a 
corresponding W;, and vice versa. 


. By hypothesis, A” = U(A"\V(J;)). Taking complements, we have NV (J;) = 0. But by 


(8.1.2), AV(I;) = V(UL) = V(J), so by the weak Nullstellensatz, I = k[Xq,..., Xn]. 
Thus the constant polynomial 1 belongs to J. 


. Suppose that the open sets A” \ V(J;) cover A”. By Problem 7, 1 € J, hence 1 


belongs to a finite sum )°,.,J;. Since 1 never vanishes, V()0,<p Ji) = 0. By (8.1.2), 
NicrVi = 0, where V; = V(J;). Taking complements, we have Ujer(A” \ V;) = A”. 
Thus the original open covering of A” has a finite subcovering, proving compactness. 


Section 8.2 


1. 


If a ¢ (ai,...,@,), then g or some other element of J would extend the inductive 
process to step k + 1. 


. In going from d; to dj+1 we are taking the minimum of a smaller set. 


. By minimality of m, a ¢ (a1,...,@m_—1), hence f, and g satisfy conditions 1 and 2. 


By choice of fim we have d,, < d. (If m= 1, then d, < d by choice of f;.) 


. Let f be the unique ring homomorphism from R[Xj,...,Xn] to S such that f is the 


identity on R and f(X;) = 2, i= 1,...,n. (For example, if a € R, then aX?X] > 
ax}x{.) Since the image of f contains R and {x1,...,2,}, f is surjective and the result 
follows. 


. By the Hilbert basis theorem, R[X1,..., Xp] is a Noetherian ring, hence a Noetherian 


R[X1,...,Xpn]-module. By (7.5.7), S is a Noetherian R[X1,...,X,]-module. But the 
submodules of S considered as an R[.X1,..., Xn]-module coincide with the submodules 
of S as an S-module. (See Section 4.2, Problems 6 and 7; note that the kernel of the 
homomorphism f of Problem 4 annihilates S.) Thus S' is a Noetherian S-module, that 
is, a Noetherian ring. 


Section 8.3 


1. 


Suppose that cy € J with « € J and y ¢ J. By maximality of J, the ideal J + (2) 
contains an element s € S. Similarly, J+ (y) contains an element t € S. But then 
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st € (J + (x))\(J + (y)) CJ + (xy) C J, 80 SN J #0, a contradiction. 


. Let S = {1, f, f?,...,f7,...}. Then INS = @ since f ¢ VI. By Problem 1, I is 


contained in a prime ideal P disjoint from S. But f € S,so f cannot belong to P, and 
the result follows. 


. The “if”? part follows because f and f" have the same zero-set. Conversely, if 


V(f) = V(g), then by the Nullstellensatz, \/(f) = /(g), and the result follows. 


.W CY since (t*)? = (t?)(t®) and (t5)? = (¢°)?(t*); L C V by direct. verification. 


Conversely, if y? = xz and z? = xy, let t = y/x. (If c = 0, then y = z = 0 and 
we can take t = 0.) Then z = y?/x = (y/x)*x = t?z, and z? = xy. Therefore 
2? = tte? = ay, hence y = t*. If t = 0 then y = 0, hence z = 0 and (2, y,z) € L. 
Thus assume t 4 0. But then x = y/t = t? and z=#r=?°. 


. We will show that I(V) is a prime ideal (see the exercises in Section 8.1). If fg € I(V), 


then fg vanishes on V. Using the parametric form, we have f(t, t?,t?)g(t, t?,t?) = 0 
for all complex numbers t. Since we are now dealing with polynomials in only one 
variable, either f(t,t?,t?) = 0 for all t or g(t,t?,t?) = 0 for all t. Thus f € I(V) 
org E€1(V). 


. (a) «= 2t/(t? +1), y= (# —1)/(t? +1) 


(b) c=. 9se 
(c) c=?t?-1, y=t(t? -1) 


. (Following Shafarevich, Basic Algebraic Geometry, Vol.1, page 2.) We can assume that 


x appears in f with positive degree. Viewing f and g as polynomials in k(y)[2], a PID, 
f is still irreducible because irreducibility over an integral domain implies irreducibility 
over the quotient field. If g = fh where h is a polynomial in x with coefficients in k(y), 
then by clearing denominators we see that f must divide g in k{x,y], a contradiction. 
(Since f is irreducible, it must either divide g or a polynomial in y alone, and the 
latter is impossible because x appears in f.) Thus f does not divide g in k(y)[a]. 
Since f and g are relatively prime, there exist s,t € k(y)[a] such that fs + gt = 1. 
Clearing denominators, we get u,v € k[x,y] such that fu + gv = a, where a isa 
nonzero polynomial in y alone. Now if a,@ € k and f(a,8) = g(a,G@) = 0, then 
a(3) = 0, and this can only happen for finitely many @. For any fixed (3, consider 
f(x, 8) = 0. If this polynomial in x is not identically 0, then there are only finitely 
many a such that f(a, 3) = 0, and we are finished. Thus assume f(x, 3) = 0. Then 
f(x,y) = f(x,y) — f(a, B) = (y — B)h in k(x)[y], contradicting the irreducibility of f. 


Section 8.4 


1 


. Since f = 0 iff some f; = 0, V(f) is the union of the V(f;). Since each f; is irreducible, 


the ideal I; = (f;) is prime by (2.6.1), hence V(J;) = V(f;) is an irreducible subvariety 
of V(f). [See the problems in Section 8.1, along with the Nullstellensatz and the 
fact that every prime ideal is a radical ideal (Section 8.3, Problem 2).] No other 
decomposition is possible, for if V(f;) C V(f;), then (fi) D (fj). This is impossible 
if f; and f; are distinct irreducible factors of f. 


10. 


19 


. By the Nullstellensatz, IV(f) = ./(f), and we claim that \/(f) = (fi---f;). For if 


g € (fi--: f,), then a sufficiently high power of g will belong to (f). Conversely, if 
g™” = hf, then each f; divides g™, and since the f; are irreducible, each f; divides g, 
so (fi --- f,) divides g. 


. By Problem 1, f is irreducible if and only if V(f) is an irreducible hypersurface. If f 


and g are irreducible and V(f) = V(g), then as in Problem 1, (f) = (g), so f = cg 
for some nonzero constant c (Section 2.1, Problem 2). Thus f — V(f) is a bijection 
between irreducible polynomials and irreducible hypersurfaces, if the polynomials f 
and cf,c # 0, are identified. 


. This follows from the definition of I(X) in (8.1.3), and the observation that a function 


vanishes on a union of sets iff it vanishes on each of the sets. 


. By Section 8.1, Problem 5, every variety V is the union of finitely many irreducible 


subvarieties Vi,...,V;. By Problem 4, I(V) = N%_,I(V;). By the Problems in Sec- 
tion 8.1, each I(V;) is a prime ideal. By (8.4.3), every radical ideal is [(V) for some 
variety V, and the result follows. 


. By Section 8.1, Problem 6, and the inclusion-reversing property of I (part (4) of 


(8.1.3)), the decomposition is unique if we discard any prime ideal that properly 
contains another one. In other words, we retain only the minimal prime ideals. 


. If f is any irreducible factor of any of the f;, then f does not divide g. Thus for some 


j #14, f does not divide f;. By Problem 7 of Section 8.3, the simultaneous equations 
f = f; = 0 have only finitely many solutions, and consequently X is a finite set. 


. With notation as in Problem 7, f; = ghi, where the gcd of the h,; is constant. Thus 


X is the union of the algebraic curve defined by g = 0 and the finite set defined by 
hy =-:- = hy» = 0. (This analysis does not apply when X is defined by the zero 
polynomial, in which case X = A?.) 


. If k =R, the zero-set of 2? + y?” is {(0,0)} for all n =1,2,.... If k is algebraically 


closed, then as a consequence of the Nullstellensatz, V(f) = V(g) with f and g 
irreducible implies that f = cg for some constant c. (See Problem 3). 


Let k = Fo, and let J be the ideal of k[X] generated by f(X) = X?+ X +1. Since 
f is irreducible, J is a maximal ideal (Section 3.1, Problem 8), in particular, I is 
proper. But f(0) and f(1) are nonzero, so V(I) is empty, contradicting the weak 
Nullstellensatz. 


Section 8.5 


di, 


If « ¢ M, then the ideal generated by M and « is R, by maximality of MW. Thus there 
exists y © M and z € R such that y+ zz = 1. By hypothesis, zx, hence 2, is a unit. 
Take the contrapositive to conclude that every nonunit belongs to M. 


. Any additive subgroup of the cyclic additive group of Zp» must consist of multiples 


of some power of p, and it follows that every ideal is contained in (p), which must 
therefore be the unique maximal ideal. 


. No. Acan be nilpotent, that is, some power of A can be 0. The set will be multiplicative 


if A is invertible. 
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. S-'(gf) takes m/s to g(f(m))/s, as does S~tgS~1f. If f is the identity on M, then 


S~'f is the identity on S~!M. 


. By hypothesis, gf = 0,so S~'gS-!f = S~-tgf = S-10 =0. ThusimS~!f C ker S~1g. 


Conversely, let  € N,s € S, with x/s € ker S~+g. Then g(x)/s = 0/1, so for some 
t € S we have tg(x) = g(tx) = 0. Therefore tx € kerg = im f, so tx = f(y) for some 
y € M. We now have x/s = f(y)/st = (S~'f)(y/st) € im S~'f. 


. The set of nonuits is M = {f/g: g(a) £0, f(a) = 0}, which is an ideal. By (8.5.9), R 


is a local ring with maximal ideal M. 


. The sequence 0 — N — M — M/N — 0 is exact, so by Problem 5, 0 — Ng = 


Ms — (M/N)s — 0 is exact. (If f is one of the maps in the first sequence, the 
corresponding map in the second sequence is S~!f.) It follows from the definition of 
localization of a module that Ns < Mg, and by exactness of the second sequence we 
have (M/N)s = Ms/Ns, as desired. 


Section 8.6 


1. 


If z”” belongs to the intersection of the J;, then x belongs to each VJ;, so x € Mea Te 
Conversely, if x € nevi, let v™ € J;. If m is the maximum of the m,;, then 


er emis, sore /reiyh. 


. We are essentially setting X = Z = 0 in R, and this collapses R down to k[Y]. Formally, 


map f+Jtog+/J, where g consists of those terms in f that do not involve X or Z. 
Then R/P = k[Y], an integral domain. Therefore P is prime. 


3. (X+D(Y+ND=2Z7+1]¢€ P?, but X+1¢ P? andY+I¢VP2=P. 
4. P, is prime because R/P, ~ k{Y], an integral domain. P» is maximal by (8.3.1), so P? 


is P)-primary by (8.6.6). The radical of Q is P2, so by (8.6.5), Q is Po-primary. 


. The first assertion is that 


(X?, XY) = (X)N (X,Y)? = (X) Nn (X?, XY, Y?) 
and the second is 
OAV Se Yh 


In each case, the left side is contained in the right side by definition of the ideals 
involved. The inclusion from right to left follows because if f(X,Y)X = g(X,Y)Y? 
(or f(X,Y)X = g(X,Y)Y), then g(X,Y) must involve X and f(X,Y) must involve Y. 
Thus f(X,Y)X is a polynomial multiple of XY. 


. By (8.6.9), a proper ideal J can be expressed as the intersection of finitely many primary 


ideals Q;. If Q; is Pj-primary, then by Problem 1, 
T=VI=0/Q, = 8, 


. Since X° and Y” belong to In, we have X,Y € WIn, so (X,Y) C VWIy. By (8.3.1), 


(X,Y) is a maximal ideal. Since VJ, is proper (it does not contain 1), we have 
(X,Y) = VIn. By (8.6.5), I, is primary. 
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Section 8.7 


1. 


[(e.y+y') +G] —[(2,y) + G] -[(x,y/) + G] = 0 since (2, y+ y') — (z,y) -— (a, y') € G 
r{(x, y) + G] — [(raz, y) + G] =0 since r(z, y) — (rz, y) € G; the other cases are similar. 


. Let a and 6 be integers such that am+bn = 1. If x € Z, and y € Zp, then 


rOy=1(4#@y) =a(mz ®y) + W(a @ ny) = 0 since z @0=0@2z=0. 


. Let A be a torsion abelian group, that is, each element of A has finite order. If x € A 


and y € Q, then na = 0 for some positive integer n. Thus 1 @ y = n(x ® (y/n)) = 
nz ® (y/n) =0® (y/n) = 0. 


. We have h = g/h’ = g/gh and h’ = gh = gg/h’. But if P = T and f = h, then 
g gg g 


g = 1r makes the diagram commute, as does g’g. By the uniqueness requirement in 
the universal mapping property, we must have g’g = 17, and similarly gg’ = 17. Thus 
T and T” are isomorphic. 


.n®r=n(1®r)=18nr=180=0. 
_nZ@LZ, = ZZ, = Z, by (8.7.6), withn®x# > 182-2, and since x 40, n@a 


cannot be 0. 


. We have a bilinear map (f,g) — f ®g from Homr(M, M’) x Hompr(N,N’) to 


Homr(M@RN, M'@RN’), and the result follows from the universal mapping property 
of tensor products. 


. In terms of matrices, we are to prove that M,,(R) @ M,(R) = Minn(R). This follows 


because M;,(R) is a free R-module of rank ¢?. 


Section 8.8 


1. 


Yr (Yet yy) Yep) Up HY Ye Ye Up FY Yer Ue Up ty 
Yio Up + YL Yi Y5°** Yp- The left side, as well as the first and last terms on the 
right, are zero by definition of N. Thus yy +++ yas Yje' Yp = —Yr Up Yi Yps a8 
asserted. 


. If w is any permutation of {a,...,b}, then by Problem 1, 


Ln(a)*** &n(b) = (SEN T)La:++ Zp. 


The left side will be +2,---x», regardless of the particular permutation 7, and the 
result follows. 


. The multilinear map f induces a unique h: M®? — Q such that h(y; ®--- @ yp) = 


f(Yi,-++,Yp). Since f is alternating, the kernel of h contains N, so the existence and 
uniqueness of the map g follows from the factor theorem. 


. By Problem 3, there is a unique R-homomorphism g: A"M — R such that 


g(y1-++ Yn) = f(yi,---,Yn). In particular, g(a1-+- an) = f(a1,...,%) = 140. Thus 
Z1+++&_, 40. Ifr is any nonzero element of R, then g(ra1x2---%y)=f(rx1,..., Xn) =", 
$0 TX1+++XLyn #0. By Problem 2, {x1 ---x,} is a basis for A"M. 


. Fix the set of indices Jp and its complementary set Jo. If Say arxy = 0, x; € R, 


multiply both sides on the right by xj,. If [ A Ip, then wrx z, = 0 by definition of N. 
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Thus a,,¢1,tj, = £ay,%1°++%p = 0. By Problem 4, az, = 0. Since Jo is arbitrary, the 
result follows. 


. We have Ro C S by definition of S. Assume that R,, C S form =0,1,...,n—1, and 


let a€ Ry, (n > 0). Then a € I, so a = S>y_, civ; where (since x; € Rn, and R is the 
direct sum of the Rm) c; € Rn—n,. By induction hypothesis, c; € S, and since x; € S$ 
by definition of S, we have a € S, completing the induction. 


. The “if? part follows from Section 8.2, Problem 5, so assume R Noetherian. Since 


Ro = R/I, it follows that Ro is Noetherian. Since R is Noetherian, J is finitely 
generated, so by Problem 6, R = S, a finitely generated Ro-algebra. 


Section 9.1 


1. 


Assume R is simple, and let « € R,x #0. Then Rx coincides with R, so 1 € Rx. Thus 
there is an element y € R such that yx = 1. Similarly, there is an element z € R such 
that zy = 1. Therefore 


z=2l=zyxr=1lr=2, sorzy=zy=1 


and y is a two-sided inverse of x. Conversely, assume that FR is a division ring, and 
let x be a nonzero element of the left ideal J. If y is the inverse of x, then 1 = yz € J, 
so I= Rand R is simple. 


. I is proper because f(1) = a2 40, and R/I = Re by the first isomorphism theorem. 


. The “if” part follows from the correspondence theorem, so assume that M is simple. 


If 2 is a nonzero element of M, then M = Ra by simplicity. If J = kerf as in 
Problem 2, then M © R/T, and I is maximal by the correspondence theorem. 


. The “only if” part was done in Problem 3, so assume that M is not simple. Let N bea 


submodule of M with 0 < N < M. Ifa is anonzero element of N, then Ra < N <M, 
so x cannot generate M. 


. By Problem 3, a simple Z-module is isomorphic to Z/I, where I is a maximal ideal 


of Z. By Section 2.4, Problems 1 and 2, I = (p) where p is prime. 


. As in Problem 5, a simple F[X]|-module is isomorphic to F[X]/(f), where f is an 


irreducible polynomial in FX]. (See Section 3.1, Problem 8.) 


. If x is a nonzero element of V and y an arbitrary element of V, there is an endomor- 


phism f such that f(x) = y. Therefore V = (End; V)a. By Problem 4, V is a simple 
End; (V)-module. 


. By (4.7.4), every such short exact sequence splits iff for any submodule N < M, 


M ~N@®P, where the map N — M can be identified with inclusion and the map 
M — P can be identified with projection. In other words, every submodule of M is a 
direct summand. Equivalently, by (9.1.2), M is semisimple. 


Section 9.2 


1. 


Unfortunately, multiplication by r is not necessarily an R-endomorphism of M, since 
r(sx) = (rs)a, which need not equal s(rx) = (sr)a. 
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. Let x be a generator of M, and define f: R— M by f(r) = ra. By the first isomor- 


phism theorem, M = R/ann M. The result follows from the correspondence theorem. 


. Let M = Zp ®Z,y where p is prime. Then M is not asimple Z-module, but ann M = pZ 


is a maximal ideal of Z. 


. The computation given in the statement of the problem shows that (1,0) is a generator 


of V, hence V is cyclic. But N = {(0,6): b € F} is a nontrivial proper submodule of 
V. (Note that T(0,b) = (0,0) € N.) Therefore V is not simple. 


. Since F is algebraically closed, f has an eigenvalue A € F’. Thus the kernel of f — AI 


is not zero, so it must be all of M. Therefore f(m) = Am for all m € M. 


.Ifrelands+J e€ R/T, thenr(s+J) =rs+TI. But if I is not a right ideal, we cannot 


guarantee that rs belongs to J. 


Section 9.3 


1. 


For each j = 1,...,n, there is a finite subset I[(j) of J such that 2; belongs to the 
direct sum of the M;,i € I(j). If J is the union of the I(j), then MC Qe; Mi C M, 
so M is the direct sum of the M;,i € J. 


. Each simple module M;,i = 1,...,n, is cyclic (Section 9.1, Problem 4), and therefore 


can be generated by a single element x;. Thus M is generated by x1,...,2n.- 


3. A left ideal is simple iff it is minimal, so the result follows from (9.1.2). 


. No. If it were, then by Section 9.1, Problem 5, Z would be a direct sum of cyclic groups 


of prime order. Thus each element of Z would have finite order, a contradiction. 


. By (4.6.4), every finite abelian group is the direct sum of various Z,, p prime. If p and 


q are distinct primes, then Z, © Zy =~ Zpq by the Chinese remainder theorem. Thus Z,, 
can be assembled from cyclic groups of prime order as long as no prime appears more 
than once in the factorization of n. (If Z, © Z, is part of the decomposition, the group 
cannot be cyclic.) Consequently, Z,, is semisimple if and only if n is square-free. 


. This follows from Section 9.1, Problem 8. (In the first case, B is semisimple by hy- 


pothesis, and in the second case A is semisimple. The degenerate case M = 0 can be 
handled directly.) 


. Conditions (a) and (b) are equivalent by (9.3.2) and the definition of semisimple ring. 


By Problem 6, (b) implies both (c) and (d). To show that (c) implies (b) and (d) 
implies (b), let M be a nonzero R-module, with N a submodule of M. By hypothesis, 
the sequence 0 — N — M — M/N — 0 splits. (By hypothesis, M/N is projective in 
the first case and N is injective in the second case.) By Section 9.1, Problem 8, M is 
semisimple. 


Section 9.4 


1. 


If there is an infinite descending sequence I; D Ig D--- of left ideals, we can proceed 
exactly as in (9.4.7) to reach a contradiction. 
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. Let I; be any nonzero left ideal. If J, is simple, we are finished. If not, there is 


a nonzero left ideal Iz such that I, D Ip. If we continue inductively, the Artinian 
hypothesis implies that the process must terminate in a simple left ideal ;. 


. By Problem 2, the ring R has a simple R-module M. The hypothesis that R has no 


nontrivial two-sided ideals implies that we can proceed exactly as in (9.4.6) to show 
that M is faithful. 


. If V is infinite-dimensional over D, then exactly as in (9.4.7), we find an infinite 


descending chain of left ideals, contradicting the assumption that R is Artinian. 


. By Problem 4, V is a finite-dimensional vector space over D, so we can reproduce the 


discussion preceding (9.4.7) to show that R = Endp(V) = M,,(D°). 


. The following is a composition series: 


0< M, <M, 8 Mo <::-<M6MoS:::-6M, =M. 


. By (9.1.2), M is a direct sum of simple modules. If the direct sum is infinite, then we 


can proceed as in Problem 6 to construct an infinite ascending (or descending) chain 
of submodules of M, contradicting the hypothesis that M is Artinian and Noetherian. 


Section 9.5 


1. 


If g € kerp, then gu = v for every v € V. Take v = lg to get glg =1lg,sog=1g 
and p is injective. 


- (gh)(v@) = v(g(A@)) and g(h(v(4))) = go(h@)) = v(g(h@)). Also, le(v(i)) = 


v(la(t)) = v(%). 


- We have g(v(1)) = v(4), g(v(2)) = v(2), g(v(3)) = v1), g(v(4)) = v(3), so 


. We have guy = v2, gv2 = v3 = —V1 — V2; hvy = v1, hve = v3 = —v, — v2. Thus 


i=[) a). 


Sli eal 


. We have v; = e; and vg = te, + 5 V8e9. Thus 


1 —3 1 3v3 
eS 1. eS 
0 3v3 0 3Vv3 
and 
/ —1 = 2 : 
[9] =P" [g)P = 
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. Check by direct computation that the matrices A and B satisfy the defining relations 


of Dg: At = I, B? = I, AB = BA™!. (See Section 5.8.) 


. Yes. Again, check by direct computation that the matrices A’BJ, i = 0,1,2,3, 7 = 0,1, 


are distinct. Thus if g € Dg and p(g) = J, then g is the identity element of Dg. 


Section 9.6 


1 


Let W be the one-dimensional subspace spanned by v,+v2+v3. Since any permutation 
in S3 permutes the v;, v € W implies gu € W. 


. Multiplying [a”] by aI and A we have a™v, = v; and a’vg = rv, + ve. Since W is 


spanned by vj, it is closed under the action of G and is therefore a kG-submodule. 


_ If 


then «+ ry = cx and y=cy. If y #0 then c= 1, so ry, hence y, must be 0. Thus 
c = 1 and z is arbitrary, so that any one-dimensional kG-submodule must coincide 
with W. 


. If V =W OU, where U is a kG-submodule, then U must be one-dimensional. By 


Problem 3, W = U, and since W £ 0, this is impossible. 


. If M is semisimple and either Noetherian or Artinian, then M is the direct sum of 


finitely many simple modules. These simple modules are the factors of a composition 
series, and the result follows from (7.5.12). 


. Let e be the natural projection on M,. Then e is an idempotent, e 4 0 since M, 4 0, 


and e #1 since e= 0 on Mz and M2 £0. 


. Let e be a nontrivial idempotent, and define e; = e, eg = 1—e. By direct computation, 


e, and eg are nontrivial idempotents that are orthogonal. Take M, = e1(M), Mz = 
e2(M). Then M, and Mg are nonzero submodules with M = M, + Mp. To show that 
the sum is direct, let z = eyx = egy € M,N Mo, with x,y € M. Then e,z = ejegy = 0, 
and similarly e2z = 0. Thus z = lz =e,z + e2z2 = 0. 


Section 9.7 


de, 


If M = Ra, define f: R — M by f(r) = ra. By the first isomorphism theorem, 
M & R/ker f. Moreover, ker f = ann(M). Conversely, R/I is cyclic since it is 
generated by 1+ J. 


. If N is a maximal submodule of M, then N is the kernel of the canonical map of M 


onto the simple module M/N. Conversely, if f: 4 — S, S simple, then f(M) is 0 
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or S,so f =O or S = M/ker f. Thus ker f is either M or a maximal submodule of M. 
The intersection of all kernels therefore coincides with the intersection of all maximal 
submodules. [If there are no maximal submodules, then the intersection of all kernels 
is M.] 


. By the correspondence theorem, the intersection of all maximal left ideals of R/T is 0. 


This follows because the intersection of all maximal left ideals of R containing J is 
the intersection of all maximal left ideals of R. [Note that J(R) is contained in every 
maximal left ideal, and I = J(R).| 


. Let g be an R-module homomorphism from N to the simple R-module S. Then 


gf: M — S, so by Problem 2, J(M) C ker(gf). But then f(J(M)) C ker g. Take the 
intersection over all g to get f(J(M)) C J(N). 


. Suppose a € J(R). If 1+ab is a nonunit, then it belongs to some maximal ideal M. But 


then a belongs to M as well, and therefore so does ab. Thus 1 € M, a contradiction. 
Now assume a ¢ J(R), so that for some maximal ideal M, a ¢ M. By maximality, 
M+ Ra = R,so1=2+ra for some x € M andr € R. Since x belongs to M, it 
cannot be a unit, so if we set b = —r, it follows that 1+ ab is a nonunit. 


. By the correspondence theorem, there is a bijection, given by 7(A) = A/N, between 


maximal submodules of M containing N and maximal submodules of M/N. Since 
N < J(M) by hypothesis, a maximal submodule of M containing N is the same 
thing as a maximal submodule of M. Thus J(M) corresponds to J(M/N), that is, 
w(J(M)) = J(M/N). Since p(J(M)) = J(M)/N, the result follows. 


Section 9.8 


1. 


a‘ € (a‘t'), so there exists b € R such that a’ = ba't+. Since R is an integral domain 
we have 1 = ba. 


. Let a be a nonzero element of the Artinian integral domain R. The sequence (a) D 


(a?) D ... stabilizes, so for some t we have (a) = (a‘t!). By Problem 1, a has an 
inverse, proving that R is a field. 


. If P is a prime ideal of R, then R/P is an Artinian integral domain by (7.5.7) and 


(2.4.5). By Problem 2, R/P is a field, so by (2.4.3), P is maximal. 


. We have POI € S and PNI C I, so by minimality of J, POI = I. Thus 


PIT=N"_1]j, so by (7.6.2), P 2 I; for some j. But P and J; are maximal ideals, 
hence P = Ij. 


_Ifz € F, with z=a+y,2 6M, y €N, define h: F — M/JM ®N/JIN by 


h(z) = (a@+JM)+(y+JN). Then h is an epimorphism with kernel JM+JN = JF, 
and the result follows from the first isomorphism theorem. 


. By (9.8.5), M/JM is an n-dimensional vector space over the residue field k. Since F, 


hence F'/. JF’, is generated by n elements, F'/JF has dimension at most n over k. 
Thus N/JN must have dimension zero, and the result follows. 


. Multiplication of an element of J on the right by a polynomial f(X,Y) amounts to 


multiplication by the constant term of f. Thus J is finitely generated as a right R- 
module iff it is finitely generated as an abelian group. This is a contradiction, because 


10. 
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as an abelian group, 


T= OS ZX°Y. 


. By the Hilbert basis theorem, Z[X] is a Noetherian ring and therefore a Noetherian 


left Z[X]-module. The isomorphic copy Z[X]Y is also a Noetherian left Z[X]-module, 
hence so is R, by (7.5.8). A left ideal of R that is finitely generated as a Z[X]-module 
is finitely generated as an R-module, so R is left-Noetherian. 


. The set {%,...,%,} spans V, and therefore contains a basis. If the containment is 
proper, then by (9.8.5) part (ii), {v1,...,%n} cannot be a minimal generating set, a 
contradiction. 


In vector-matrix notation we have y = Az and therefore 7 = Az, where Qj =aiyjtJ. 
By Problem 9, % and 7 are bases, so that det A 4 0. But under the canonical map of 
R onto k, det A maps to det A, and therefore det A cannot belong to the kernel of 
the map, namely J. But by (8.5.9), J is the ideal of nonunits of R, so det A is a unit. 


Section 10.1 


. This is exactly the same as the proof for groups in (1.1.1). 
. Any ring homomorphism on Q is determined by its values on Z (write m = (m/n)n 


and apply the homomorphism g to get g(m/n) = g(m)/g(n). Thus if gi = hi, then g 
coincides with h on Z, so g = h, proving that 7 is epic. 


. Let AZB be a shorthand notation for the composition of the unique morphism from 


A to the zero object Z, followed by the unique morphism from Z to B. If Z’ is another 
zero object, then AZB = (AZ'Z)B = A(Z'ZB) = A(Z'B) = AZ'B, as claimed. 


. Ifis = it, then fis = fit =0, so is(= it) = ih where h is unique. Thus s and ¢ must 


coincide with h, hence z is monic. 


. A kernel of a monic f: A — B is 0, realized as the zero map from a zero object Z 


to A. For f0 = 0, and if fg = 0, then fg = f0; since f is monic, g = 0. But then g 
can be factored through 0z,4. Similarly, a cokernel of the epic f: A — B is the zero 
map from B to a zero object. 


. If j = th and i = jh’, then i = ihh’, so by uniqueness in part (2) of the definition of 


kernel (applied when g = 7), hh’, and similarly h’h, must be the identity. 


. Define h: kK — A by h(x) = 1 for all x. Since K is nontrivial, h cannot be injective, 


so that g # h. But fg = fh, since both maps take everything in K to the identity 
of B. 


. The kernel of a ring homomorphism is an ideal, but not a subring (since it does not 


contains the multiplicative identity). 


. Let f: A— B bea noninjective ring homomorphism. Let C be the set of pairs (2, y) 


in the direct product A x A such that f(x) = f(y). Since f is not injective, there is 
an element (a, y) of C with « 4 y. Thus if D = {(z,2): « € A}, then DCC. If g is 
the projection of A x A on the first coordinate, and h is the projection on the second 
coordinate, then f(g(x,y)) = f(x) and f(h(a,y)) = f(y), so fg = fh on the ring C. 
But g and h disagree on the nonempty set C \ D, so f is not monic. 
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10. Let g be the canonical map of N onto N/f(M), and let h: N — N/f(M) be identically 
zero. Since gf sends everything to 0, we have gf = hf with g 4 h. Thus f is not epic. 


Section 10.2 


1. Suppose that y is the product of the x;. By definition of product, if fj: « — x; for 
all i, there is a unique f: x — y such that p;f = f;. Since p;: y — xj, we have y < 2;. 
Moreover, if « < x; for all i, then x < y. Therefore y is a greatest lower bound of 
the Xj. 

2. No. For example, consider the usual ordering on the integers. 

3. By duality, a coproduct of the x;, if it exists, is a least upper bound. 


4. If x has order r and y has order s, then rs(a + y) = s(rz) + r(sy) = 0. Thus the sum 
of two elements of finite order also has finite order, and the result follows. 

5. The key point is that if f is a homomorphism of a torsion abelian group S, then f (5) 
is a also torsion [since nf(«) = f(na)]. Thus in diagram (1) with A = [| A;, we have 
f(S) C T(A). Since J] A; is the product in the category of abelian groups, it follows 
that T(A) satisfies the universal mapping property and is therefore the product in the 
category of torsion abelian groups. 


6. Given homomorphisms f;: G; — H, we must lift the f; to a homomorphism from 
the free product to H. This is done via f(a1---an) = fi(ai)---fn(G@n). If i; is the 
inclusion map from G'; to *;G;, then f(i;(a;)) = f(a;) = f;(a;), as required. 

7. We have p;f = f;, where f;: G — C;. The f; can be chosen to be surjective (e.g., take 
G to be the direct product of the C;), and it follows that the p; are surjective. 


8. Since f: C, — C, we have f(a1) = na for some positive integer n. Thus 


a1 = fi(a1) = pi f(ai) = pi(na) = npi(a) = nai; 
0 = f2(ai) = pof (ai) = po(na) = np2(a) = nag. 


9. By Problem 8, the order of C; divides n — 1, and the order of Cy divides n. There are 
many choices of C; and C2 for which this is impossible. For example, let C; and C2 
be nontrivial p-groups for a fixed prime p. 


Section 10.3 


1. If f:¢7— y, then Ff: Fx — Fy. By definition of the category of preordered sets, this 
statement is equivalent to 7 < y => Fa < Fy. Thus functors are order-preserving 
maps. 

2. F must take the morphism associated with ry to the composition of the morphism 
associated with Fx and the morphism associated with Fy. In other words, F(ay) = 
F(a) F(y), that is, F is a homomorphism. 


3. If 6 € X*, then (gf)*(8) = Bgf and f*g*(8) = f* (G9) = Bof. 
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. To verify the functorial property, note that 


and 
gp a g** (v** f*) ss Cian ae ae 


Thus (gf)** = g** f**. If f is the identity, then so is f*, and consequently so is f**. 


. ftv(v) = Ff") = vf", and if 6 € W*, then (of*)(8) = v(f*8) = (f*8)(v) = BF). 


But tw f(v) = f(v) where f(0)(3) = Bf(). 


. Groups form a subcategory because every group is a monoid and every group homo- 


morphism is, in particular, a monoid homomorphism. The subcategory is full because 
every monoid homomorphism from one group to another is also a group homomor- 
phism. 


. (a) If two group homomorphisms are the same as set mappings, they are identical as 


homomorphisms as well. Thus the forgetful functor is faithful. But not every map 
of sets is a homomorphism, so the forgetful functor is not full. 


(b) Since (f,g) is mapped to f for arbitrary g, the projection functor is full but not 
faithful (except in some degenerate cases). 


Section 10.4 


1. 


If a homomorphism from Zz to Q takes 1 to z, then 0 =1+1—-2+2 = 22. But 0 
must be mapped to 0, so x = 0. 
1 


. A nonzero homomorphism can be constructed with 0 — 0,1 —> 5. Then 1+1— 


2 
$+4=1=0inQ/Z. 


. Since a trivial group cannot be mapped onto a nontrivial group, there is no way that 


Fg can be surjective. 


. Let f be a homomorphism from Q to Z. If r is any rational number and m is a positive 


integer, then 


so 


But if f(r) 40, we can choose m such that f(r)/m is not an integer, a contradiction. 
Therefore f = 0. 


. By Problem 4, Hom(Q, Z) = 0. But Hom(Z, Z) 4 0, so as in Problem 3, Gf cannot be 


surjective. 
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6. 


We have Zp ® Z = Zo and Z2 ®Q=0 
(6 =t6 “6s he =i 
nn Qn Qn In 


Thus the map Hf cannot be injective. 


. Since f, = Ff is injective, fa = 0 implies a = 0, so f is monic and hence injective. 


Since g. fx = 0, we have gfa = 0 for all a € Hom(M, A). Take M = A and a= 1, to 
conclude that gf = 0, so that im f C kerg. Finally, take M = kerg anda: M— B 
the inclusion map. Then g,a = ga = 0, so a € kerg, =imf,. Thus a= f(@ for some 
83 € Hom(M, A). Thus kerg = M =ima Cimf. 


. If (3) is exact for all possible R-modules N, then (1) is exact. This is dual to the result 


of Problem 7, and the proof amounts to interchanging injective and surjective, monic 
and epic, inclusion map and canonical map, kernel and cokernel. 


Section 10.5 


1. 


If « € P, then f(x) can be expressed as 5°, t;e; (a finite sum), and we define f;(x) = t; 
and x; = m(e;). Then 


r= n(f(z)) = OD tei) = Ds tin(ei) = 1» fi(x) xi. 


- W(F(2)) = Ly Ala)r(er:) = i fil@)ai = @. 


. By Problem 2, the exact sequence 0 — kera — F > P — 0 (with 7: F — P) splits, 


and therefore P is a direct summand of the free module F and hence projective. 


. Since R” is free, the “if” part follows from (10.5.3), part (4). If P is projective, then 


by the proof of (3) implies (4) in (10.5.3), P is a direct summand of a free module of 
rank n. [The free module can be taken to have a basis whose size is the same as that 
of a set of generators for P.] 


. This follows from Problem 1 with F = R”. 


. If P is projective and isomorphic to M/N, we have an exact sequence 0 — N — 


M — P — 0. Since P is projective, the sequence splits by (10.5.3), part (3), so P is 
isomorphic to a direct summand of M. Conversely, assume that P is a direct summand 
of every module of which it is a quotient. Since P is a quotient of a free module F, it 
follows that P is a direct summand of F. By (10.5.3) part (4), P is projective. 


. In the diagram above (10.5.1), take M = R; © Ro, with R; = Ro = R. Take P = 


N= R&,, f =1n,, and let g be the natural projection of M on N. Then we can take 
h(r) =r+s,r€ Ri,s © Ro, where s is either 0 or r. By replacing M by an arbitrary 
direct sum of copies of R, we can produce two choices for the component of h(r) in each 
R;, i = 2,3,... (with the restriction that only finitely many components are nonzero). 
Thus there will be infinitely many possible choices for h altogether. 
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Section 10.6 


1. f’g(a) = (g(a),0) + W and g/f(a) = (0, f(a)) + W, with (g(a),0) — (0, f(a)) = 
(g(a), -f(@)) © W. Thus f’g = of. 
2. If (b,c) € W, then b = g(a),c = —f(a) for some a € A. Therefore 


g'"(c) + f(b) = —9" F(a) + f" g(a) = 0 


and hf is well-defined. 

3. hg’(c) = h((0,c) + W) = g"(c) +0 = g"(c) and hf’(b) = h((b,0) + W) = 0+ f"(b) = 
f"(0). 

4. hi((b,c) + W) = h'((0,c) + W + (6,0) + W) = h'(g’(e) + f'(0)) = h’g'(c) + A’ f'(b) = 
g'"(e) + f"(b) = A((b, c) + W). 

5. If f’(b) = 0, then by definition of f’, (b,0) € W, so for some a € A we have b = g(a) 
and f(a) = 0. Since f is injective, a = 0, hence b = 0 and f" is injective. 

6. If b € B,c € C, then surjectivity of f gives c = f(a) for some a € A. Thus 
f(b + g(a)) = (6 + g(a),0) + W = (b+ gla),0) + W + (—g(a), f(a)) + W [note 
that (—g(a), f(a)) € W] = (0, f(a)) + W = (b,c) + W, proving that f’ is surjective. 

7. If (a,c) € D, then f(a) = g(c) and fg'(a,c) = f(a), gf'(a,¢) = g(e). Thus fg! = gf’. 

8. Ifa € E, then fg"(x) = gf" (x), so (g(x), f"(x)) € D. Take h(x) = (g"(2), f’(2)), 

which is the only possible choice that satisfies g’h = g” and f’h= f”. 

9. If (a,c) € D and f’(a,c) = 0, then c = 0, so f(a) = g(c) = 0. Since f is injective, 

a = 0. Consequently, (a,c) = 0 and f’ is injective. 

10. If c € C, then there exists a € A such that f(a) = g(c). Thus (a,c) € D and 

f' (a,c) =c, proving that f’ is surjective. 

11. Ifa,y € J, then f(ay) = af(y) and f(ay) = f(yx) = yf(x). Thus xf(y) = yf(a), 
and if « and y are nonzero, the result follows upon division by xy. 

12. We must extend f: [> Q toh: RQ. Let z be the common value f(x)/x, x € I, 
x #0. Define h(r) = rz, r € R. Then h is an R-homomorphism, and if « € I, « £0, 
then h(x) = xz = xf(«x)/x = f(x). Since h(0) = f(0) = 0, h is an extension of f and 
the result follows from (10.6.4). 


Section 10.7 
1. The only nonroutine verification is the check that sf € Homr(M, N): 


(sf)(rm) = f(rms) = rf(ms) = r[(sf)(m)]. 


2. (fr)(ms) = f(r(ms)) = f((rm)s) = f(rm)s = [(fr)(m)]s. 
3. (sf)(mr) = s(f(mr)) = s(f(m)r) = (sf(m)r = [(sf)(m)]r. 
4. (fr)(sm) = (f(sm))r = (sf(m))r = s(f(m)r) = s[(fr)(m)] 
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. In Problem 2, M and N are right S-modules, so we write f on the left: (fr)m = 


f(rm). In Problem 3, M and WN are right R-modules, and we write f on the left: 
(sf)m = s(fm). In Problem 4, M and WN are left S-modules, and we write f on the 


right: m(fr) = (mf)r. 


. Let y € M,re Rwith r 40. By hypothesis, « = ty € M, so we have x € M such 


that y = rz, proving M divisible. 


.Ifye M,r€ Rwithr F 0, we must define ty. Since M is divisible, there exists 7 € M 


such that y = rx, and we take +y = x. If x’ € M and y = ra’, then r(x — 2’) = 0, 
and since M is torsion-free, 7 = x’. Thus x is unique and scalar multiplication is 
well-defined. 


. Let f be a nonzero R-homomorphism from Q to R. Then f(u) = 1 for some u € Q. [If 


f(x) =r £0, then rf(a/r) = f(ra/r) = f(x) =7, so we can take u = x/r.| Now if s 
is a nonzero element of R, then sf(u/s) = f(su/s) = f(u) = 1, so f(u/s) is an inverse 
of s. Consequently, R is a field, contradicting the hypothesis. 


Section 10.8 


1. 


Let A = R[X] where R is any commutative ring. As an R-algebra, A is generated 
by X, but A is not finitely generated as an R-module since it contains polynomials of 
arbitrarily high degree. 


. The bilinear map determined by (X',Y’) — X*Y/ induces an R-homomorphism of 


R[X] @pr R[Y] onto R[X,Y], with inverse determined by X*Y? — X'@Y!. 


. Abbreviate X1,...,X, by X and Yj,...,Y¥m by Y. Let A be a homomorphic image 


of R[X] under f, and B a homomorphic image of R[Y] under g. Then A@r Bisa 
homomorphic image of R[X] ® R[Y](@ R[X, Y] by Problem 2) under f ® g. 


. If f: A — Bis an injective R-module homomorphism, then by hypothesis, (1@f): S®@r 


A— S @p B is injective. Also by hypothesis, 
(L@(1@f)): M@sSSrA>MBsS@RB 


is injective. Since M @s S = M, the result follows. 


. Let f: A — B be injective. Since A®s S = A and B@g S & B, it follows from the 


hypothesis that (f @1): A@s(S@rM)— B@s(S @rRM) is injective. Thus S$ @p M 
is a flat S-module. 


. a is derived from the bilinear map S~!R x M — S~!M given by (r/s,x) > ra/s. We 


must also show that / is well-defined. If x/s = y/t, then there exists u € S such that 
utz = usy. Thus 


1 1 1 
—~@xr%=—@L=— Oulxzx= — Ousy= By 
sut sut t 


as required. By construction, a and @ are inverses of each other and yield the desired 
isomorphism. 


. We must show that S~!R @pR — is an exact functor. But in view of Problem 6, an 


equivalent statement is the localization functor S~1 is exact, and this has already been 
proved in Section 8.5, Problem 5. 
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Section 10.9 


1. 


The proof of (10.9.4) uses the fact that we are working in the category of modules. To 
simply say “duality” and reverse all the arrows, we would need an argument that did 
not depend on the particular category. 


. Let N be the direct limit of the N;. The direct system {Nj;, h(i,7)} induces a direct 


system {M@ N;,1@h(i, 7)}. Compatibility in the new system reduces to compatibility 
in the old system; tensoring with 1 is harmless. Since compatible maps f;: N; — B 
can be lifted to f: N — B, it follows that compatible maps g;: M ® N; — B can be 
lifted to g: M@N — B. Thus M ® N satisfies the universal mapping property for 
{M ® Nj}. 


. The direct limit is A = US2, An, with a,: A, — A the inclusion map 


. Each R-homomorphism f from the direct sum of the A; to B induces an 


R-homomorphism f;: 4; > B. [fj is the injection of A; into the direct sum, fol- 
lowed by f]. Take a(f) = (fi,i € 1) € [];Homr(A;, B). Conversely, given such a 
family (fi,i € I), the f; can be lifted uniquely to G(f;,i € I) = f. Since a and £ are 
inverse R-homomorphisms, the result follows. 


_If f: A — JJ, Bi, define a(f) = (pif,i € I) € [],Homp(A, B;), where p; is the 


projection of the direct product onto the i*” factor. Conversely, given (g;,i € I), where 
gi: A— B,, the g; can be lifted to a unique g: A — [J], B; such that pig = g; for all i. 
If we take G(g;,i € I) = g, then a and @ are inverse R-homomorphisms, and the result 
follows. 


. There is a free module F such that F = M @ M’, and since F is torsion-free, so is M. 


Since M is injective, it is divisible, so by Problem 7 of Section 10.7, M is a vector 
space over the quotient field Q, hence a direct sum of copies of @. Therefore, using 
Problem 5 above and Problem 8 of Section 10.7, 


Homar(M, R) = Homp(SQ, R) ~ | [ Hom(Q, R) = 0. 


. By Problem 7, Homr(M, R) = 0. Let M be a direct summand of the free module F’ 


with basis {r;,i € I}. If a is a nonzero element of M, then x has some nonzero 
coordinate with respect to the basis, say coordinate j. If p; is the projection of F' on 
coordinate j (the j*” copy of R), then p; restricted to M is a nonzero R-homomorphism 
from M to R. (Note that « does not belong to the kernel of p;.) Thus the assumption 
that M # 0 leads to a contradiction. 
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